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Preface 


This book presents a series of research and expository articles under the broad 
banner of Analytic Number Theory. Written by eminent mathematicians from the 
international mathematical community, the chapters in this book are dedicated 
to the 60th birthday of Professor Helmut Maier. His own research has been 
groundbreaking and deeply influential, especially in the problem of gaps between 
consecutive primes. 

Helmut Maier was born in Geislingen/Steige, Germany, on October 17, 1953. 
He graduated with a Diploma in Mathematics from the University of Ulm in 1976, 
under the supervision of Professor H.E. Richert. After a period as a scientific 
employee at the University of Ulm, he started his doctoral studies at the University 
of Minnesota in 1979. He received his Ph.D. in 1981 with the thesis entitled 
Some results on prime numbers based on the application of sieve theory, under the 
supervision of Professor J. Ian Richards. The thesis was an extension of his paper 
Chains of large gaps between consecutive primes, which appeared in Advances in 
Mathematics, 39 (1981), 257-269. In this paper, Maier applied for the first time 
what is now called Maier’s Matrix Method. This method later on led him and 
other mathematicians to the discovery of unexpected irregularities in the distribution 
of prime numbers. The Matrix Method was also applied to other questions like 
irreducible polynomials and consecutive primes in the same residue class modulo 
an integer. 

After postdoctoral positions at the University of Michigan and the Institute for 
Advanced Study, Princeton, Maier then took a permanent position at the University 
of Georgia. During this period, he showed that the usual formulation of the Cramér 
model for the distribution of primes is wrong, a completely unexpected result. With 
C. Pomerance, he did research on the values of Euler’s y-function and large gaps 
between primes. One theme in this period was Professor Maier’s work on the size of 
the coefficients of cyclotomic polynomials, done partially in collaboration with S. 
Konyagin and E. Wirsing. He also collaborated with H.L. Montgomery on the size of 
the sum of the Mobius function under the assumption of the Riemann Hypothesis. 
Maier and G. Tenenbaum in joint work investigated the sequence of divisors of 
integers, solving the famous propinquity problem of Erdés. 


vi Preface 


Since 1993, Maier is a Professor at the University of Ulm, Germany. Lately, 
M.Th. Rassias and Maier worked on problems related to the Estermann zeta 
function and the Nyman-—Beurling criterion on the Riemann Hypothesis involving 
the distribution of certain cotangent sums. In their latest work, they studied prime 
avoidance of perfect powers of prime numbers. 

Other collaborators of Professor Maier include P. Erdés, J. Friedlander, 
A. Granville, D. Haase, A.J. Hildebrand, M.L. Lapidus, J.W. Neuberger, 
A. Sankaranarayanan, A. Sarkézy, and C.L. Stewart. 

We are grateful to all of the mathematicians who participated in this publication. 

We also wish to express our thanks to Springer for undertaking the publication 
of this book. 


Hanover, NH, USA Carl Pomerance 
Princeton, NJ, USA Michael Th. Rassias 
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CM-Points on Straight Lines 


Bill Allombert, Yuri Bilu, and Amalia Pizarro-Madariaga 


To Helmut Maier on his 60th anniversary 


Abstract We prove that, with “obvious” exceptions, a CM-point (j(t1),j(t2)) 
cannot belong to a straight line in C? defined over Q. This generalizes a result of 
Kiihne, who proved this for the line x; + x. = 1. 


1 Introduction 


In this article t with or without indices denotes a quadratic! complex number with 
Imt > 0 and j denotes the j-invariant. 

In 1998 André [1] proved that a non-special irreducible plane curve in C? may 
have only finitely many CM-points. Here a plane curve is a curve defined by 
an irreducible polynomial equation F(x),x2) = 0, where F is a polynomial with 
complex coefficients. A CM-point in C? is a point of the form (j(t;),j(t2)) with 
quadratic tT, t2. Special curves are the curves of the following types: 


“Quadratic” here and below means “of degree 2 over Q ”. 
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¢ “vertical lines” x; = j(t); 

¢ “horizontal lines” x2. = j(t2); 

¢ modular curves Yo(N), realized as the plane curves ®y(x1,x2) = 0, where ®y is 
the modular polynomial of level N. 


Clearly, each special curve contains infinitely many CM-points, and André 
proved that special curves are characterized by this property. 

André’s result was the first non-trivial contribution to the celebrated André- 
Oort conjecture on the special subvarieties of Shimura varieties; see [11] and the 
references therein. 

Independently of André the same result was also obtained by Edixhoven [7], but 
Edixhoven had to assume the Generalized Riemann Hypothesis for certain L-series 
to be true. 

Further proof followed; we mention specially the remarkable argument of Pi- 
la [10]. It is based on an idea of Pila and Zannier [12] and readily extends to higher 
dimensions [11]. 

The arguments mentioned above were non-effective, because they used the 
Siegel-Brauer lower bound for the class number. Breuer [5] gave an effective proof, 
but it depended on GRH. 

Recently Kiihne [8, 9] and, independently, Bilu et al. [3] found unconditional 
effective proofs of André’s theorem. Besides giving general results, both articles [9] 
and [3] treat also some particular curves, showing they have no CM-points at all. 
For instance, Kiihne [9, Theorem 5] proves the following. 


Theorem 1.1. The straight line x, + x. = 1 has no CM-points. 


(The same result was also independently obtained in an earlier version of [3], but 
did not appear in the final version.) 

A similar result for the curve x;x. = 1 was obtained in [3]. 

One can ask about CM-points on general straight lines defined over Q; that is, 
defined by an equation 


Aix; + Arx2 + B= 0, (1) 


where Aj, Az, B € Q. One has to exclude from consideration the special straight 
lines: x; = j(™|), x2 = j(t2) and x; = x2, the latter being nothing else than the 
modular curve Yo(1) (the modular polynomial ®; is x; — x2). According to the 
theorem of André, these are the only straight lines containing infinitely many CM- 
points. 

In the present paper we obtain a rather vast generalization of Theorem 1.1. 


Theorem 1.2. Let (j(t1),j(t2)) be a CM-point belonging to a non-special straight 
line defined over Q. Then we have one of the following options. Either 


Hu), (2) € Q (2) 
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or 


it) FI), Qa) = VMU()), (AG) : A = [QG(~)) : Q| = 2. 
(3) 


Remark 1.3. 1. Recall that [QG(t) : Q] = h(G,), the class number of the “com- 
plex multiplication order” @, = End(t, 1), where (t, 1) is the lattice generated 
by t and 1. All orders of class number | and 2 are well known, which means 
that points satisfying (2) or (3) can be easily listed. In fact, there are 169 CM- 
points satisfying (2) and, up to Q-conjugacy, 217 CM-points satisfying (3); see 
Remark 5.3 for the details. 

2. Our result is best possible because any point satisfying (2) or (3) does belong to 
anon-special straight line defined over Q. 

3. Kiihne remarks on page 5 of his article [9] that his Theorem 4 allows one, in 
principle, to list all possible CM-points belonging to non-special straight lines 
over Q, but the implied calculation does not seem to be feasible. 

4. Bajolet [2] produced a software package for finding all CM-points on a given 
straight line. He illustrated its efficiency by proving that no straight line (1) with 
non-zero Aj, Ao, B € Z satisfying |A;|, |A2|, |B] < 10 passes through a CM-point. 
This work is now formally obsolete because of our Theorem 1.2, but a similar 
method can be used in more general situations, where our theorem no longer 
applies. 

5. CM-points (x1,x2) satisfying x)x. € Q* are completely classified in [4]; this 
generalizes the above-mentioned result from [3] about the curve x;x2 = 1. 


In Sects.2 and 3 we recall basic facts about imaginary quadratic orders, class 
groups, ring class fields and complex multiplications. 

In Sect. 4 we investigate the field equality QG(t1)) = QGU(t)). In particular, in 
Corollary 4.2 we determine all cases of such equality when Q(t) # Q(t). This 
might be of independent interest. 

After all these preparations, we prove Theorem 1.2 in Sect. 5. 


2 Imaginary Quadratic Orders 
In this section we recall basic facts about imaginary quadratic fields and their orders, 
and recall a famous result of Weinberger about class groups annihilated by 2. 


2.1 Class Groups 


Let K be an imaginary quadratic field and @ an order in K of discriminant 
A= Df?, where D is the discriminant of the field K (often called the fundamental 
discriminant) and f is the conductor of @, defined from @ = Z + fOx. We denote 
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by Cl(@) the class group of @ (the group of invertible fractional ideals modulo the 
invertible principal fractional ideals). As usual, we set h(@) = |Cl(@)|. Since @ is 
uniquely determined by its discriminant A, we may also write Cl(A), h(A), etc. In 
particular, Cl(D) = Cl(@x) is the class group of the field K, and h(D) is the class 
number of K. 

There is a canonical exact sequence 


1 > Clo(A) > Cl(A) > Cl(D) > 1, (4) 


where the kernel Clo(A) will be described below. This implies, in particular, that 
h(D) | h(A). 

The structure of the group Clo(A) is described, for instance, in [6], Sect. 7.D and 
Exercise 7.30. We briefly reproduce this description here. We will assume, with a 
slight abuse of notation, that Z/fZ is a subring of Ox/fOx. Then we have another 
canonical exact sequence 


1 — (Z/fZ)" (Og) > (Ox/f Ox)” > Clo(A) > 1, (5) 


where (@7 )+ is the image of the multiplicative group Of in (Ox/f Gx). 
The group (Z/fZ)*(@x)¢ is “not much bigger” than (Z/fZ)*. Precisely, 


2 ifD=-4,f >1, 
[(Z/fZ)* (Og )p : (Z/fZ)"| = [Og : O*] = 43 if D=-3,f > 1, 


1 otherwise. 


An easy consequence is the following formula for (A): 
pot (5)") 
h(A) = 1-|{-— ; (6) 
[OR : | II P 


where (D/-) is the Kronecker symbol. 


2.2. Orders with Class Groups Annihilated by 2 


In this subsection we recall the famous result of Weinberger about imaginary 
quadratic orders whose class group is annihilated by 2. For a multiplicatively written 
abelian group G we denote by G? its subgroup of squares: G? = {g” : g € Gt. 

The group Cl(A)/Cl(A)? is usually called the genus group of A. It is known to 
be isomorphic to (Z/2Z)", where jz = (A) € {@(A) — 1, w(A)} and w(-) denote 
the number of distinct prime divisors. We may also remark that w(A) = w(A) — 1 
when A = D (and f = 1). 
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Already Euler studied discriminants A with the property 
|C1(A)?| = 1, (7) 


or, equivalently, Cl(A) = (Z/2Z)”. (Of course, he used a different terminology.) 
Chowla proved that the set of such A is finite. Using a deep result of Tatuzawa [13] 
about Siegel’s zero, Weinberger [16] improved on this, by showing that field dis- 
criminants D with this property are bounded explicitly with at most one exception. 
To state Weinberger’s result precisely, denote by D’ the square-free part of D: 


j D, D=1 mod 4, 


~ )D/4, D=0mod 4. 


Proposition 2.1 (Weinberger [16], Theorem 1). There exists a negative inte- 
ger D* such that for any discriminant D of an imaginary quadratic field the property 
|Cl(D)?| = 1 implies |D’| < 5460 or D = D*. 


Using existing methods [15], it is easy to determine the full list of D with 
|Cl(D)?| = 1 and |D’| < 5674. Since the group Cl(D) is a quotient of the group 
Cl(Df?), if the latter has property (7) then the former does. Also, for every given D 
it is easy to find all possible f such that |Cl(Df)*| = 1, using the description of the 
group Cl(Df”) given in Sect. 2.1. Hence the couples (D,f) for which |Cl(Df?)?| = 1 
and |D’| < 5674 can be easily listed as well. This list is widely available in the 
literature since long ago; we reproduce it in Table 1. 

It follows that Weinberger’s result has the following consequence. 


Corollary 2.2. There exists a negative integer D* such that |C\(Df)”| = 1 implies 
that either A = Df? appears in Table 1 or D = D*. 


Table 1 Known A with |Cl(A)?| = 1 

h(A) = 1| -3,-3-2?,-3- 3”, -4, -4- 27, -7,-7-2?, -8, -11, -19, —43, —67, -163 

h(A) = 2| —3- 47, -3-5?, -3- 7°, -4- 37, -4-4?, 4-5? -7-4?, -8-2?, -8 - 3, -11 -3?, 
15, -15- 2?, —20, —24, -35, —40, —51, —52, -88, —91, —115, —123, —148, 
187, —232, —235, —267, —403, —427 

h(A) > 4| —3- 82, -7-8?,-8- 62, -15- 47, -15- 82, —20- 3”, -24- 2?, -35 - 3, —40 - 2?, 
84, —88 - 2?, 120, —120 - 2?, -132, 168, —168 - 2?, —195, —228, —232 - 2?, 
280, —280 - 2?, —312, —312 - 27, -340, —372, —408, —408 - 2?, —420, —435, 
483, —520, —520 - 27, —532, —555, —595, —627, —660, —708, —715, 
760, —760 - 27, —-795, —840, —840 - 2”, —1012, —1092, —1155, —1320, —1320- 2?, 
1380, —1428, —1435, —1540, —1848, —1848 - 2?, 1995, —3003, —3315, —5460 
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Remark 2.3. Class numbers of discriminants from Table | are at most 16, and the 
results of [15] imply that Table 1 contains all A with |CI(A)?| = 1 and h(A) < 64. 
Hence if A satisfies |CI(A)?| = 1 but does not appear in Table 1 then we must have 
h(A) = 128. 

In particular, the first two lines of Table | give full lists of negative quadratic 
discriminants A with h(A) = 1 and 2. 


3 Ring Class Fields and Complex Multiplication 


Let K be an imaginary quadratic field, and @ an order in K of discriminant A = Df?. 
One associates with @ an abelian extension of K with Galois group Cl(@), called the 
ring class field of @. We will denote it by RiCF(@), or RiCF(A), or RiCF(K,f). 
The canonical isomorphism Cl(@) — Gal(RiCF(@)/K) is called the Artin map. 
For the details see, for instance, [6, Sect. 9]. 

The correspondence @ < RiCF(@) is functorial in the following sense: if @” is a 
sub-order of @, then RiCF(@”) C RiCF(@), and we have the commutative diagram 


Cl(@) > Gal(RiCF(@)/K) 


1 1 
Cl(6’) > Gal(RiCF(0’)/K) 


where the horizontal arrows denote Artin maps and the vertical arrows are the 
natural maps of the class groups and the Galois groups. It follows that the Galois 
group of RiCF(@) over RiCF(@”) is isomorphic to the kernel of Cl(@) > Cl(@”). 
In particular, Gal(RiCF(@) /RiCF(@x)) is Clo(A), the group introduced in Sect. 2.1. 
(One may notice that RiCF(@x) is nothing else but the Hilbert class field of K.) 


3.1 Compositum of Ring Class Fields 


In this subsection it will be more convenient to use the “conductor notation” 
RiCF(K,f). 

As we have seen above, if fi | f then RiCF(K, fi) C RiCF(K,f). It follows that 
the compositum of two ring class fields RiCF(K,f\) and RiCF(K,f2) is a subfield 
of RiCF(K,f), where f = LCM(/i, 2). It turns out that this compositum is “almost 
always” equal to RiCF(K,f), but there are some exceptions. Here is the precise 
statement. It is certainly known, but we did not find it in the available literature. 


Proposition 3.1. Let K be an imaginary quadratic field of discriminant D and f,, fr 
positive integers. Set f = LCM(fi, fz). Then we have the following. 
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1. If D 4 —3, —4 then RiCF(K, f, RiCF(K, f:) = RiCF(K,f). 

2. Assume that D € {—3 — 4}. Then RiCF(K,f, )RiCF(K, 2) = RiCF(K,f) either 
when one of fi,f2 is 1 or when gcd(fi,f2) > 1. On the contrary, when 
fist. > 1 and gcd(fi,f2) = 1, the compositum RiCF(K, f,)RiCF(K, f2) is a sub- 
field of RiCF(K, f) of degree 2 for D = —4 and of degree 3 for D = —3. 


Proof (A Sketch). To simplify the notation, we set 


Lo = RiCF(K, 1), L, = RiCF(K, fi), Ly = RiCF(K, fy), 
L = RiCF(K,f), Uo =Lb. 


The mutual position of these fields is illustrated here: 


We want to determine the degree [L: L’]. 
We have Gal(L/Lo) = Clo(Df?). By (5), this implies 


Gal(L/Lo) = (@x/fOx)* /(Z/fZ)* (GR), 
Similarly, 
Gal(L;/Lo) = (@x/fi@x)*/(Z/fiZ)* (Og), = i= 1,2). 
The Galois group Gal(L/L;) is the kernel of the natural map 
(Ox [fOx)* |(Z/fZ)" (Og )¢ > (Ox [On |(Z/fZ)* (Op 

Hence Gal(L/L’) is the common kernel of the maps 7 and z». It follows that 
Gal(L/L') = G/(Z/fZ)*(@x),, where G is the subgroup of (Cx /fOx)* consisting 
of x € (Ox/f Gx)™ satisfying 

x € (Z/[fZ)(Og)y mod fi (i= 1,2). (8) 


In particular, [L : L'] = [G: (Z/fZ)*(@z)s]. 
If D ¢ —3, —4 then OF = {+1}, which implies that 


G = (Z/fZ)* (Ox) = (Z/fZ)* 


and L = L’. 
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Now assume that D = —4. Then OF = {+1,4V—1}. When fi = 1 or fp = 1 
the statement is trivial, so we may assume that f|,f. > 1. Condition (8) can be re- 
written as 


x € Z/fZU (Z/fZ)V—-1U ((Z/fDfi + (Z/fDV—V) UV ((Z/fD fp + (Z/fDfiv—1). 9) 


If gcd(f,,f2) > 1, then the last two sets in (9) have no common elements with 
(Ox/fCx)*. We obtain 


G = (Z/fZU (Z/fZ)V-1) 0 (Gx/fOx)* = (Z/fZ)(OR)y, 


and L = L’. 

If gcd(fi,f2) = 1, then each of the last two sets in (9) has elements belonging 
to (Ox/fCx)* but not to (Z/fZ)(Cx),; for instance, fi +hv-1 and fp +f, /—1, 
respectively (here we use the assumption f1,f2 > 1). Hence [G : (Z/fZ)(@z)¢] > 1. 
On the other hand, if x and y belong to the last two sets in (9), then xy € Z/fZ if 
they belong to the same set, and xy € (Z/fZ)/—1 if they belong to distinct sets. 
This shows that [G : (Z/fZ)(@x)¢] = 2, and hence [L : L'] = 2. This completes the 
proof in the case D = —4, 

The case D = —3 is treated similarly. We omit the details. oO 


3.2. Complex Multiplication 


Ring class fields are closely related to the Complex Multiplication. Let t € K with 
Imt > 0 be such that @ = End(t, 1) (where (z, 1) is the lattice generated by t 
and 1); one says that @ is the complex multiplication order of the lattice (t, 1). 

The “Main Theorem of Complex Multiplication” asserts that j(t) is an 
algebraic integer generating over K the ring class field RiCF(@). In particular, 
[K(j(t)) : K] = h(@). In fact, one has more: 


[KG(c)) : K] = [QG()) : Y = A(@). (10) 


The proofs can be found in many sources; see, for instance, [6, Sect. 11]. 

Since RiCF(A) = K(j(t)) is a Galois extension of K, it contains, by (10), all 
the Q-conjugates of j(t). It follows that K(j(t)) is Galois over Q; in particular, the 
Galois group Gal(K/Q) acts on Gal(K(j(t))/K). 


Proposition 3.2. The Galois group Gal(K/Q) acts on Gal(K(j(t))/K) “dihe- 
drally”: if «is the non-trivial element of Gal(K/Q), then we have o' = a7! for 
any o € Gal(K(j(t))/K). 


For a proof see, for instance, [6, Lemma 9.3]. 


CM-Points on Straight Lines 9 


Corollary 3.3. The following properties are equivalent. 


1. The field K(j(t)) is abelian over Q. 

2. The Galois group Gal(K(j(t))/K) is annihilated by 2 (that is, isomorphic to 
Z/2Zx-+++x Z/2Z). 

3. The Galois group Gal(K (j(t))/Q) is annihilated by 2. 

4. The field Q(j(t)) is Galois over Q. 

5. The field Q(Gj(t)) is abelian over Q. 


Proof. The implications 1=2=>3 follow from Proposition 3.2. The implica- 
tion 3=4 is trivial. To see the implication 4=5, just observe that (10) implies 
the isomorphism Gal(K(j(t))/K) = Gal(Q(Gj(t))/Q). Finally, the implication 5=> 1 
is again trivial. Oo 


3.3. The Conjugates of j(t) 


Let t and @ be as in Sect. 3.2, and let A be the discriminant of @. As we already 
mentioned in the beginning of Sect. 3.2, j(t) is an algebraic integer of degree h(A). 
It is well known that the Q-conjugates of j(t) can be described explicitly. Below we 
briefly recall this description. 

Denote by T = T, the set of triples of integers (a, b, c) such that 


gcd(a,b,c) = 1, A=b’ —4ac, 


either -—a<b<a<c or 0<b<a=c 


Proposition 3.4. All Q-conjugates of j(t) are given by 


(7). (a,b,c) € Ta. (11) 


In particular, h(A) = |Ta\. 


For a proof, see, for instance, [6, Theorem 7.7]. 
The following observation will be crucial: in the set T, there exists exactly one 
triple (a, b,c) with a = 1. This triple can be given explicitly: it is 


(1, r4(A), w=") 


where r4(A) € {0, 1} is defined by A = r4(A) mod 4. The corresponding number 
j(t), where t = (—r4(A) + /A)/2, will be called the dominant j-value of discrim- 
inant A. It is important for us that it is much larger in absolute value than all its 
conjugates. 
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Lemma 3.5. Let j(t) be the dominant j-value of discriminant A, with |A| > 11, 
and let j(t') 4 j(t) be conjugate to j(t) over Q. Then |j(t')| < 0.1[j(t)}. 

Proof. Recall the inequality ||j(z)| — |g_'|| < 2079, where g, = e?"", for z belong- 
ing to the standard fundamental domain of SL2(Z) on the Poincaré plane [3, 
Lemma 1]. We may assume that t = (—r4(A) + V/A)/2 and 1’ = (—b + V/A) /2a 
with a > 2. Hence |g,| = e7V/4I > e7¥™ > 33506 and |qz7| < |gz|!/2. We obtain 


We’) — Igel/? + 2079 _ 335061/? + 2079 


0.1, 
li(t)| ~ |qr]— 2079 ~ 33506 — 2079 = 


as wanted. oO 


The minimal polynomial of j(t) over Z is called the Hilbert class polynomial* of 
discriminant A; it indeed depends only on A because its roots are the numbers (11). 
We will denote it Hy (x). 


4 Comparing Two CM-Fields 


In this section we study the field equality Q((t1)) = QU(t)). We distinguish two 
cases: Q(t) 4 Q(t), when we obtain the complete list of all possibilities, and 
Q(t) = Q(t), where we will see that A, and A> are “almost the same.” Here for 
i= 1,2 we denote by A; the discriminant of the “complex multiplication order” 
O; = End(z;, 1), and write A; = Dif? with the obvious meaning of D; and fj. 


4.1 The Case Q(t) # Q(t2) 


In this subsection we investigate the case when Q(j(t,)) = Q(j(72)), but the fields 
Q(t) and Q(t) are distinct. It turns out that this is a very strong condition, which 
leads to a completely explicit characterization of all possible cases. 


Theorem 4.1. Let t, and t, be quadratic numbers such that Q(G(t)) = QG(t)), 
but Q(t) 4 Q(t). Then both A, and A, appear in Table 1. 


Proof. Denote by L the field Q(G(t1)) = Q(Gj(t)). If L is a Galois extension of Q, 
then the group Cl(A,) = Cl(A;) is annihilated by 2 by Corollary 3.3, and we can 
use Corollary 2.2. Since D; 4 D2, at least one of the two discriminants D; and Dy 


Or, sometimes, ring class polynomial, to indicate that its root generates not the Hilbert class field, 
but the more general ring class field. 
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is distinct from D*; say, D} 4 D*. Then A, is in Table 1. Since h(A,) = h(A2), 
Remark 2.3 implies that A, is in Table | as well. (This argument goes back to 
Kiihne [9, Sect. 6].) 

Now assume that 


Lis not Galois over Q. (12) 


We will show that this leads to a contradiction. Denote by M the Galois closure of L 
over Q; then, M = Q(t, j(t1)) = Q(t, j(t)). Define the Galois groups 


G = Gal(M/Q), N = Gal(M/Q(t1, t2)), 
N; = Gal(M/Q(t;)) = Cl(A;) (= 1,2), 


so that N = N; N.Np and [N, : N] = [Nz : N] = 2. 
We claim the following: 


the group N is annihilated by 2. (13) 


(One may mention that this is a special case of an observation made independently 
by Edixhoven [7] and André [1].) 

Indeed, let 1; be an element of G acting non-trivially on 1 but trivially on 1p. 
Then for any o € N; we have o! = a! by Proposition 3.2. On the other hand, for 
any o € No we have o"! = o. It follows that o = o! foro € N, proving (13). 

Thus, each of the groups N; and N has a subgroup of index 2 isomorphic to 
(Z/2Z)" for some integer 4. Hence each of N; and N> is isomorphic either to 
(Z/2Z)#*! or to Z/4Z x (Z/2Z)#}. 

If, say, N; & (Z/2Z)#*!, then L is Galois over Q by Corollary 3.3, contradict- 
ing (12). Therefore 


N, & Ny & Z/4Z x (Z/2Z)#!. 


Let « € G be the complex conjugation. Since 7 extends the non-trivial element 
of Gal(Q(t;)/Q), the group H = {1,1} acts on N, dihedrally: of = 07! for 
o € N,, and we have G = N, » H. Let N; and N/’ be subgroups of N; isomorphic 
to Z/4Z and (Z/2Z)"~', respectively, such that N} = Nj x Ni’. Then H commutes 
with N/’ and acts dihedrally on N;. Hence 


G=N,™“H =(N, xH) x Ni & Dg x (Z/2Z)""', 


where D>, denotes the dihedral group of 2n elements. 

Now observe that Dg x (Z/2Z)#~! has only one subgroup isomorphic to the 
group Z/4Z x (Z/2Z)"—'; this follows, for instance, from the fact that both groups 
have exactly 2” elements of order 4. Hence N; = N2, which implies the equality 
Q(t) = Q(t), a contradiction. oO 
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Now one can go further and, inspecting all possible pairs of fields, produce the 


full list of number fields presented as Q(j(t1)) and Q(j(t2)) with Q(t) # Q(t). 


Corollary 4.2. Let L be a number field with the following property: there exist 


quadratic tT and t, such that L = Q(G(t1)) = QG(t2)) but Q(11) F Q(t). Then L 
is one of the fields in Table 2. 


Table 2 Fields presented as Q(j(t;)) and Q(j(t2)) with Q(t) F Q(t) 


Field L [L: Q]|/ A Cl(A) 
Q 1 3, —4, -7, -8, -11, —12, —16, —19, —27, —28, —43, | trivial 
—67, —163 

Q(v2) 2 —24, —32, —64, —88 Z/2Z 
Q(v3) 2 —36, —48 Z/2Z 
Q(/5) 2 15, —20, —35, —40, —60, —75, -100,-115,-235 | Z/2Z 
Q(/13) 2 —52, —91, —403 Z/2L 
Q(V/17) 2 —51,—187 Z/2Z 
Q/2, V3) 4 —96, —192, —288 (Z/2Z) 
Q(v3, V5) 4 —180, —240 (Z/2Z) 
Q(/5, V13) | 4 —195, —520, —715 (Z/2Z) 
Q(/2, V5) 4 120, —160, —280, —760 (Z/2Z) 
Q/5, 717) | 4 —340, —595 (Z/2Z)? 
Q/2, V3, /5)) 8 —480, —960 (Z/2Z)3 
Explanations: 


1. the third column contains the full list of discriminants A of CM-orders End(t, 1) such that 
L=QG(r)); 


2. the fourth column gives the structure of the class group Cl(A) for any such A. 


Proof. This is just a calculation using PARI. Oo 


4.2 The Case Q(t1) = Q(t) 


Now assume that Q(U(t1)) = Q(G(m)) and Q(t1) = Q(t). Denote by D the 
discriminant of the number field Q(t;) = Q(t) and write A; = f7D for i = 1,2. 


Proposition 4.3. Assume that QG(t1)) = QU(m)) and Q(t) = Q(t). Then 


either 
filfe € {1,2,1/2} (14) 


or D = —3 and fi, fo € {1, 2, 3} (in which cases Q(j(™))) = QG(tm)) = Q). 
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Proof. Put f = LCM(f,,f2). When D 4 —3, —4, Proposition 3.1 implies that 
h(f’D) = [QD j(t1),j(t2)) : QD) (15) 
Since j(t1) and j(t2) generate the same field, we obtain 
A(f?D) = h(f;D) = h(f?D). (16) 


Using (6) and (16), we obtain 


RUG-(F)o")=5 


ptf 


which implies that f/f; € {1,2}. Similarly, f/f € {1,2}. Hence we have (14). 

Now assume that D € {—3, —4}. If gcd(f,, 2) > 1, then we again have (15), and 
the same argument proves (14). 

If, say, fj = 1 then either D = —4 and fp € {1,2}, in which case we again 
have (14), or D = —3 and fa € {1, 2, 3}. 

Finally, assume that f\,f2 > 1 and gcd(fi,f2) = 1. Then f = fif2 and Proposi- 
tion 3.1 implies that 


h(f7D) = h(f;D) = €'h(f?D), (17) 
where £ = 2 for D = —4 and £ = 3 for D = —3. Using (6) and (17), we obtain 
D 
fil] (1 = (2) >} =f (=1,2), 
Pili p 


and a quick inspection shows that in this case D = —3 and {f1,f2} = {2, 3}. Oo 


5 Proof of Theorem 1.2 


We assume that P = (j(t1),j(t2)) belongs to a non-special straight line £ defined 
over Q, and show that it satisfies either (2) or (3). We define A; = Dif? as in the 
beginning of Sect. 4. 

Let Ax; + Aox. + B = 0 be the equation of . Since @ is not special, we have 
A\A2 # 0, which implies, in particular, that 


QG(1)) = QG()). (18) 


We set 


L=QG(t1)) = QG(@)), k= h(A)) = h(A2) = [L: Q. 
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If j(t1), j(t2) € Q (that is, h = 1), we are done. From now on assume that 


J(t1),j(t2) € Q. 


If j(t,) = j(t), then £ is the special line x; = x2, because it passes through the 
points (j(t;),j(t1)) and through all its conjugates over Q. Hence 


its) F jt). (19) 
If h = 2, then we have (3). From now on assume that 
h> 3, (20) 
and, in particular, 
|Ai|,|A2| > 23. (21) 
We will show that this leads to a contradiction. 


Remark 5.1. Before proceeding with the proof, remark that, for a given pair of 
distinct discriminants A, and Ap it is easy to verify whether there exists a point 
(j(t),j(t2)) on a non-special straight line defined over Q, such that A; is the 
discriminant of the CM-order End(t;, 1). Call two polynomials f(x), g(x) € Q[x] 
similar if there exist a, B, A € Q with aA # 0 such that f(ax + B) = Ag(x). Now, a 
point (j(t,),j(t2)) as above exists if and only if the class polynomials Hy, and H, 
(see end of Sect. 3.3) are similar. This can be easily verified using, for instance, the 
PARI package. 


5.1 Both Coordinates are Dominant 


It turns out that we may assume, without a loss of generality, that both j(t1) and j(t2) 
are the dominant j-values of corresponding discriminants, as defined in Sect. 3.3. 


Lemma 5.2. Assume that € is a non-special straight line containing a CM- 
point P = (j(t%),j(t%)) satisfying (18)-(20). Then £ contains a CM-point 
P’ = (j(t}),j()), conjugate to P over Q and such that j(t/) is the dominant 
j-value of discriminant A, for i = 1, 2. 


Proof. Since £ is defined over Q, all Q-conjugates of P belong to ¢ as well. 
Replacing P by a Q-conjugate point, we may assume that j(t,) is the dominant 
j-value for the discriminant A). If j(t2) is the dominant value for A. we are done; 
so assume it is not, and show that this leads to a contradiction. 

Since both j(t;) and j(t2) generate the same field of degree h over Q, the Galois 
orbit of P (over Q) has exactly h elements; moreover, each conjugate of j(t,) occurs 
exactly once as the first coordinate of a point in the orbit, and each conjugate of j(t2) 
occurs exactly once as the second coordinate. 
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It follows that there is a conjugate point P° such that the second coordinate 
J(t)° is the dominant j-value for A>; then, its first coordinate j(t,)° is not dominant 
for A, because P? # P. Since h > 3, there exists yet another point P” with both 
coordinates not dominant for the respective discriminants. 

All three points P, P’, and po’ belong to £. Hence 


1 j(t1) f(t) 
1 j(11)" j(2)° | = 0. 
1 (m1) f(t)? 


The determinant above is a sum of 6 terms: the “dominant term” j(t))j(t2)° and 5 
other terms. Each of the other terms is at most 0.1|j(t1)j(t2)°| in absolute value: this 
follows from Lemma 3.5, which applies here due to (21). Hence the determinant 
cannot vanish, a contradiction. oO 


5.2. Completing the Proof 


After this preparation, we are ready to complete the proof of Theorem 1.2. Thus, let 
P = (j(t),j(t2)) belong to a straight line 2 defined over Q. We assume that (18)— 
(20) are satisfied, and we may further assume that j(t;), j(t2) are the dominant 
j-values of A, A>, respectively. 

If Q(t) Q(t), then Corollary 4.2 applies, and all possible L, A,, and A» 
can be found in Table 2. In particular, we have only 6 possible fields L and 15 
possible couples A,, A>. All of the latter are ruled out by verifying (using PARI) 
that the corresponding Hilbert class polynomials H,, (x) and H,, (x) are not similar, 
as explained in Remark 5.1. 

If Q(t) = Q(t), then Proposition 4.3 applies, and we have fi /f: € {1,2, 1/2}. 
Since both j(t,) and j(t2) are dominant, the case f; =f) is impossible: there is 
only one dominant j-value for every given discriminant, and we have j(t,) 4 j(t2) 
by (19). Thus, fi /f2 € {2, 1/2}. 

Assume, for instance, that f; = 2f;. Write Ay = A, so that A; = 4A. Since both 
j(t%)) and j(t2) are dominant, we may choose 


_n@A)+ VIA =r) + VA 
nt ci a ae 


1= 


It follows that t2 = Sy(t1), where 


y= (( as € SL»(Z). 
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Hence the point P = (j(t;),j(t2)) belongs to the modular curve Yo(2) realized as 
the plane curve ®2(x;, x2) = 0, where 


@7(x1,X2) = — ches + x + x + 1488x7.x0 + 1488x1.x5 + 40773375x 1x2 
_ 1620007 _ 162000x5 + 8748000000x, + 8748000000x. 
— 157464000000000 


is the modular polynomial of level 2. Since deg ®) = 4 and P belongs to a straight 
line over Q, the coordinates of P generate a field of degree at most 4 over Q. Thus, 
3<h<4. 

Looking into existing class number tables (or using PARI) one finds that there 
exist only 5 negative discriminants A such that h(A) = h(4A) € {3, 4}: 


h=3: —23,-31; 
h=4: —7-37,-39,—55. 


Verifying that the polynomials Hy and Hy, for these values of A are not similar is 
an easy calculation with PARI. Oo 


Remark 5.3. We conclude the article with some computational remarks. 


1. As indicated in the introduction, it is very easy to list all CM-points satisfying (2) 
or (3); call them rational and quadratic, respectively. 

There exist exactly 13 discriminants A with h(A) = 1, the first line of 
Table | lists them all. Hence there exist 169 rational CM-points, and listing them 
explicitly is plainly straightforward. 

As for the quadratic CM-points, there are two kinds of them: points with 


A, =A, =A, h(A)=2, j(t) andj(t) are conjugate over Q, (22) 
and points with 


A, FA, h(A;) =h(A2) = 2, QG(t1)) = QG(e)). (23) 


There exist exactly 29 negative quadratic discriminants A with h(A) = 2, see the 
second line of Table | for the complete list. Hence there exist 29, up to conjugacy, 
quadratic CM-points satisfying (22). 

The quadratic CM-points satisfying (23) can be extracted from the “quadratic” 
part of Table 2. Indeed, if Q(t;) 4 Q(t), then Aj, Az are in Table 2 by 
Corollary 4.2. And if Q(t1) = Q(t), then A;/Az € {4, 1/4} by Proposition 4.3. 
Inspecting the list of the 29 discriminants with h(A) = 2, we find that the only 
possibility is {A,, A2} = {—15, —60}. Both these values appear in Table 2 in the 
line corresponding to the field Q(/5). 
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Looking into Table 2 we find that there exist 
44-1) +22-1)+ 99-1) 4+ 33-1) +2@Q-1) = 94 


(ordered) pairs (A, Az) as in (23). Each pair gives rise to two, up to conjugacy, 
points satisfying (23). So, up to conjugacy, there are 188 points satisfying (23), 
and 188 + 29 = 217 quadratic CM-points altogether. Again, listing them explic- 
itly is a straightforward computation. 

2. Thomas Scanlon (private communication) asked whether there exists a non- 
special straight line over C passing through more than 2 CM-points. Since 


ae 1728 —884736000 =i 
287496 —147197952000} 


the points (0,0), (1728, 287496) and (—884736000, —147197952000) belong 
to the same straight line, and so do the points (0,0), (1728, —884736000) and 
(287496, —147197952000). Notice that 


(44 


; =0, j(/V—1) = 1728, j(2V—1) = 287496, 


= —884736000, = —147197952000. 


3) 
¢ 5) 


FS) 
: 2 


We verified that (up to switching the variables x, , x) these are the only such lines 
defined over Q. Precisely: 


* no 3 rational CM-points, with the exceptions indicated above, lie on the same 
non-special line; 

* no line passing through conjugate quadratic CM-points contains a rational 
CM-point; 

¢ lines defined by pairs of conjugate quadratic CM-points are all pairwise 
distinct. 


The verification is a quick calculation with PARI. 
It is not clear to us whether the examples above (and absence of other 
examples) are just accidental, or admit some conceptual explanations. 
3. All the computations for this paper take about 20s. 
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1 Introduction 


In this section we set the stage for our results by quickly reviewing previous 
work on zeros of L-functions, leading up to -level correlations, densities, and the 
conjectured correspondence with random matrix ensembles. As this is a vast field 
and the readership of this book is likely to have diverse backgrounds and interests, 
we discuss in some detail the history of the subject in order to put the present 
problems in context. We concentrate on some of the key theorems and statistics, 
and refer the reader to the extensive literature for more information. After this 
quick tour we describe the Katz—Sarnak conjectures for the behavior of low-lying 
zeros, and then in Sect. 2 we state our new results for families of Maass forms (the 
reader familiar with this field can skip this section and go straight to Sect. 2). The 
analysis proceeds by using the Kuznetsov trace formula to convert sums over zeros 
to exponential sums over the primes. Similar sums have been extensively studied by 
Maier in many papers over the years (see, for example, [12, 36—40]); it is a pleasure 
to dedicate this chapter to him on the occasion of his 60th birthday. 


1.1 Zeros of L-Functions 


The Riemann zeta function ¢(s) is defined for Re(s) > 1 by 


oo —1 
C(s) =yie- I] (1-5) (1) 


s 
n=1 Pp prime e 


the Euler product expansion is equivalent to the Fundamental Theorem of Arith- 
metic on the unique factorization of integers into prime powers. Much can be 
gleaned in this regime. For example, looking at the limit as s — 1 from above 
shows the sum of the reciprocals of the primes diverges (and with just a little work 
one gets )* aa! /p ~ log log x), and hence there are infinitely many primes. The true 
utility of this function, however, doesn’t surface until we consider its meromorphic 
continuation &(s) to the entire complex plane, where 


E(s) = P(s/2)n G(s) = EU—5). (2) 


The product expansion shows &(s) has no zeros for Re(s) > 1, and from the 
functional equation the only zeros for Re(s) < 0 are at the negative even integers. 
The remaining zeros all have real part between 0 and 1; the Riemann Hypothesis 
[52] is the statement that these zeros all have real part 1/2. 

Ever since Riemann’s classic paper, researchers have exploited the connections 
between zeros of ¢(s) (and later other L-functions) to arithmetically important 
problems to translate information about the zeros to results in number theory. 


Maass Waveforms and Low-Lying Zeros 21 


For example, it can be shown that ¢(s) is never zero on the line Re(s) = 1. This 
implies the Prime Number Theorem: the number of primes at most x, 2 (x), is Li(x) 
plus a lower order term, where 


wy fa. 
ie 1 _ (3) 


excellent references for this and the subsequent results are [7, 26]. 

Similar results about primes in arithmetic progressions modulo m follow from 
analogous results about the distribution of zeros of Dirichlet L-functions L(s, 7) := 
>, X(n)/n', where x ranges over all primitive characters modulo m. It is worth 
noting that to study primes congruent to a modulo m it is not enough to study one 
specific Dirichlet L-function, but rather we need to understand the entire family 
coming from all characters of modulus m in order to invoke orthogonality relations 
to extract information about our progression from averages of (mm); this notion of 
family will be very important in our work later. 

After determining main terms, it is natural to ask about the form of the lower 
order terms. While the Riemann Hypothesis (RH) implies that m(x) = Li(x) + 
O(x'/? log x), neither it nor its generalization to other L-functions (GRH) is powerful 
enough to explain how the distribution of primes modulo m varies with the residue 
class, as these fluctuations are roughly the size of the errors from GRH. Chebyshev 
observed that there appeared to be more primes congruent to 3 modulo 4 than 
to 1 modulo 4. We now have an excellent theory (see [55]) that explains this 
phenomenon. A key ingredient is the Grand Simplicity Hypothesis, which asserts 
that the zeros of these L-functions are linearly independent over the rationals. 

Assuming RH, the non-trivial zeros of ¢(s) all have real part equal to 1/2, and 
may thus be ordered on the line. It therefore makes sense to talk about spacings 
between adjacent zeros 0; = 1/2 + iy;, or better yet spacings between adjacent 
normalized zeros (where we have normalized so that the average spacing is 1). 
Recent work has shown powerful connections between these gaps and important 
arithmetic quantities. For example, we can obtain excellent bounds on the size of 
the class groups of imaginary quadratic fields through knowing the existence of L- 
functions with multiple zeros at the central point [18, 19], or knowing that a positive 
percentage of gaps between normalized zeros of the Riemann zeta function are at 
least a certain fixed fraction of the average spacing [6]. 

The central theme in the above examples is that the more information we know 
about the zeros of the L-functions, the more we can say about arithmetically 
important questions. We started with just knowledge of the zeros on the line 
Re(s) = 1, and then extended to GRH and all non-trivial zeros having real part 
1/2, and then went beyond that to the distribution of the zeros on that critical line. 
In this chapter we expand on this last theme, and explore the distribution of zeros of 
L-functions on the critical line. 
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1.2. n-Level Correlations and Random Matrix Theory 


While zeros of L-functions is a rich subject with an extensive history, our story on 
the number theory side begins in the 1970s with Montgomery’s [47] work on the 
pair correlation of the zeros of ¢(s). Given an increasing sequence of numbers {a} 
and B Cc R"! a compact box, the n-level correlation R,,(B) is defined by 


{(a%j, — A, ae Oj, =0j,) € B, Ji <N} 
N ’ 


R,(B) := lim : (4) 
Noo 

where the indices above are distinct. Instead of using a box (which is equivalent 

to a sharp cut-off) it’s often technically easier to consider a similar version with a 

smooth test function (see [56]). 

While knowing all the n-level correlations allows one to determine all the 
neighbor spacings (see, for example, [41]), computing these for arbitrary B (or for 
any admissible test function) is well beyond current technology. There are, however, 
many important partial results. The first is the referred-to one of Montgomery [47], 
who showed that for suitable test functions the 2-level density agrees with the 2-level 
density of eigenvalues of the Gaussian Unitary Ensemble (GUE). There are many 
ways to view this ensemble of matrices. The easiest is that these are Hermitian 
matrices whose upper triangular entries are independently drawn from Gaussians 
(as the diagonal must be real, we draw from a different Gaussian for these entries 
than we do for the non-diagonal ones). An alternative definition, which explains 
the use of the word unitary, deals with the equality of the probability of choosing 
a matrix and its conjugation by a unitary matrix; note this is equivalent to saying 
the probability of a matrix is independent of the base used to write it down. From 
a physical point of view these matrices represent the Hamiltonian of a system. 
What matters are their eigenvalues, which correspond to the energy levels. While 
the entries of the matrix change depending on the basis used to write it down, the 
eigenvalues do not, which leads us to the unitary invariance condition. These and 
other matrix families had been extensively studied by Dyson, Mehta, and Wigner 
among many others; see [5, 14, 15, 21,41, 46] and the multitude of references therein 
for more on the history and development of the subject. 

This suggested a powerful connection between number theory and random 
matrix theory, which was further supported by Odlyzko’s numerical investigations 
[48, 49] showing agreement between the spacings of zeros of ¢(s) and the eigen- 
values of the GUE. Subsequent work by Hejhal [22] on the triple correlation of 
¢(s) and Rudnick and Sarnak [56] on the n-level correlations for all automorphic 
cuspidal L-functions, again under suitable restrictions, provided additional support 
for the conjectured agreement between the limiting behavior of zeros of L-functions 
(as we move up the critical line) and eigenvalues of N x N matrices (as N > oo). 

These results indicated a remarkable universality in behavior; while there are 
many random matrix ensembles, it appeared that only one was needed for number 
theory. A little thought, however, shows that this might not be the full story. 
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The reason is that the n-level correlations are insensitive to the behavior of finitely 
many zeros. In other words, we can remove finitely many zeros and not change 
R,(B). This is particularly troublesome, as there are many problems where only a 
few zeros matter. For example, the Birch and Swinnerton-Dyer conjecture [3, 4] 
states that the order of vanishing of the Z-function associated with an elliptic curve 
equals the rank of its Mordell-Weil group; thus in studies on this problem we only 
care about what is happening at the central point, and not at all about what is 
happening far away on the critical line. 

Later studies by Katz and Sarnak [28, 29] confirmed that more care is needed. 
The n-level correlations of the zeros of L-functions agree not only with those 
from the GUE, but also with those coming from the classical compact groups. 
The advantage of the latter is that the probability distribution a matrix is chosen 
is derived from the Haar measure on the group, which is a canonical choice. 
This is in sharp contrast to the definition of the GUE, where we fix a probability 
distribution and choose independent entries from it, which begs the question why 
one distribution was chosen over another (for the GUE, the answer is that the 
Gaussian is forced upon us by our assumption of the probability being invariant 
under unitary transformations of the basis). They proved that as N — oo the n-level 
correlations of the eigenvalues are the same for all the classical compact groups 
(unitary, symplectic, and orthogonal, split or not split by sign). Thus one could just 
as easily say that the zeros of €(s) behave like the eigenvalues of orthogonal matrices 
instead of the GUE. 

This led Katz and Sarnak to introduce a new statistic that is both able to 
distinguish the different classical compact groups and which depends on the 
behavior of eigenvalues near 1. We briefly describe the comparisons between 
number theory and random matrix theory. If we assume the Riemann hypothesis 
then the non-trivial zeros have real part 1/2 and we may write them as 9; = 1/2+iy; 
for y; real. On the random matrix theory side, the classical compact groups are 
unitary matrices, and we can therefore write their eigenvalues as e'* with 6; real. 
From intuition gleaned from earlier results, as well as function field analogues, Katz 
and Sarnak were led to conjecture that in the appropriate limits the behavior of zeros 
near 1/2 agree with the behavior of eigenvalues near | (more generally, one can also 
compare values of Z-functions and characteristic polynomials of matrices). 


1.3 n-Level Densities and the Katz—Sarnak Philosophy 


Unfortunately, it is not possible to compare just the zeros of one L-function near 
the central point to the eigenvalues of one matrix. As in many problems in analytic 
number theory, we need to be able to execute some type of averaging and take some 
kind of limit in order to isolate out a main term and make progress. For the n-level 
correlations (or, equivalently, for Odlyzko’s work on spacings between adjacent 
zeros), one L-function provides infinitely many zeros, and the average spacing 
between critical zeros at height T is on the order of 1/ log T. Thus if we go high up, 
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we are essentially averaging over the zeros of that L-function, and can isolate out 
a universal, main term behavior. If instead we concentrate on the low-lying zeros, 
those near the central point, the situation is very different; due to the symmetry of 
the zeros about the central point we may restrict to studying the zeros in the upper 
half plane (this is why, in the definitions below, we use even test functions). To each 
L-function L(s,f) we can associate a quantity, called the analytic conductor cy, such 
that the first few zeros are of height 1/ log cy. If we rescale so that these zeros are of 
mean spacing one, then given any constant C there are essentially a finite number 
(depending basically just on C) that are at most C. 

In order to make progress we need to collect a large number of L-functions which 
should behave similarly and are naturally connected. We call such a collection a 
family of L-functions. The definition of what is a family is still a work in progress 
(see [9] among others), but most natural collections of Z-functions are. Examples 
include families of Dirichlet characters (either all of a given conductor, all whose 
conductor is in a given range, say [N,2N], or just quadratic characters whose 
conductor is in a range), cuspidal newforms (and very important subsets, one or two 
parameter families of elliptic curves), symmetric powers of cusp forms, and so on. 
Collections that are not families would include arbitrary subsets, for example, cusp 
forms whose third Fourier coefficient is 2 modulo 5, or cusp forms whose first zero 
above the central point is at least twice the average. Typically as the conductors (or 
range) grows we have more and more L-functions in the family. The Katz—Sarnak 
philosophy is that if we take averages of statistics of zeros over the family then in 
the limit it will converge and agree with the corresponding statistic for the scaling 
limit of a classical compact group as the matrix size tends to infinity. 

The main statistic we study in this paper is their n-level density. For convenience 
of exposition we assume the Generalized Riemann Hypothesis for L(s,f) (and thus 
all the zeros are of the form 1/2 + iy;-, with y;-f real), though the statistic below 
makes sense even if GRH fails. Let ¢(x) = TTj=1 j(x;) where each ¢; is an even 
Schwartz function such that the Fourier transforms 


lo) 
Ho) = fee rrrar (5) 
—oo 
are compactly supported. The n-level density for f with test function ¢ is 


Daf.) = Y> bs (Levis) ++ On Ly Vint) + (6) 

ein 
where Ly is a scaling parameter which is frequently related to the conductor. The 
idea is to average over similar f, and use the explicit formula to relate this sum over 
zeros to a sum over the Fourier coefficients of the L-functions. See, for example, 
[27], the seminal paper in the subject and the first to explore these questions, and 
see [56] for a nice derivation of the explicit formula for general automorphic forms. 
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The subject is significantly harder if the conductors vary in the family, as then 
we cannot just pass the averaging over the forms through the test function to the 
Fourier coefficients of the Z-functions. If we only care about the 1-level density, 
then we may rescale the zeros by using the average log-conductor instead of the 
log-conductor; as this is the primary object of study below we shall rescale all the 
L-functions in a family by the same quantity, which we denote R, and we emphasize 
this fact by writing D,(f,@,R). For more on these technical issues, see [42, 43], 
which studies families of elliptic curves where the variation in conductors must 
be treated. There it is shown that if the conductors vary within a family then this 
global renormalization leads to problems, and in the 2-level computations terms 
emerge where we cannot just pass the averaging through the test function. In some 
problems, however, it is important to compute the n-level densities. One application 
is to obtain significantly better bounds on the order of vanishing at the central point 
(see [23]). Another is to distinguish orthogonal candidates if the 1-level density can 
only be computed for small support; we will elaborate on this below. 

Given a family ¥ = Uy-¥y of L-functions with conductors tending to infinity, 
the n-level density D,(.F, ¢, R; w) with test function ¢, scaling R and a non-negative 
weight function w is defined by 


Oa hae ace 43 Vere gy WADalf, $, Rs w) 
D,(F, 9, RB: w) = jim, ae mr . 


(7) 


The advantage of this statistic is that for a fixed f, individual zeros of L(s,f) can 
contribute, with most of the contribution coming from the zeros near the central 
point due to the rapid decay of the test functions. Further, as we are averaging over 
similar forms there is a hope that there is a nice limiting behavior. In our applications 
later we will have weights wr(t) = w(t/T) with T a parameter tending to infinity, 
but we suppress the subscript 7 as it is always understood. 

Katz and Sarnak [28, 29] proved that the n-level density is different for each 
classical compact group, and found nice determinant expansions for them. Set 
K(y) := oar and K. (x,y) := K(x—y)+¢€K(x+y) fore = 0, +1. They proved that 
if Gy is either the family of N x N unitary, symplectic or orthogonal families (split 
or not split by sign), the n-level density for the eigenvalues converges as N —> oo to 


foe fer. id Wrcl. and dy 


= [oe [G01 nitncOn syndy din (8) 
where 


Win, SO(even) (x) = det(Ky (xi, Xj) ii<m 


II 


det(K—1 (xj, Xj) )ij<m + a 5 (xg) det(K_1 (x1, %) i j-AR 
k=l 


Wm, SO(odd) (x) 
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1 1 
Wn,o0(x) = Wn, SO(even) (x) + <Wn, SO(odd) (x) 


2 2 
Wu) = det(Ko (xj. X)ij<m 
WinSp (x) = det(K_, (xj, Xi) )iji<m- (9) 


While these densities are all different, for the 1-level density with test functions 
whose Fourier transforms are supported in (—1,1) the three orthogonal flavors 
cannot be distinguished from each other, though they can be distinguished from the 
unitary and symplectic densities. Explicitly, the Fourier Transforms for the 1-level 
densities are 


4 1 
Wi, SO(even) (U) = 59 (u) + zn) 


~ 1 
Wi o(u) = do(u) + 5 
i 1 
Wi, sovoaa(u) = 5o(u) — 77) +1 


z 1 
Wi sp(u) = do(u) — 37) 
Wi uw) = bo(u), (10) 


where n(u) is 1, 1/2, and 0 for |u| less than 1, equal to 1, and greater than 1, 
respectively, and do is the standard Dirac Delta functional. Note that the first three 
densities agree for |u| < 1 and split (i.e., become distinguishable) for |u| > 1. Thus 
in order to uniquely specify a symmetry type among the three orthogonal candidates, 
one either needs to obtain results for support exceeding (—1, 1), or compute the 2- 
level density, as that is different for the three orthogonal groups for arbitrarily small 
support [42, 43]. 

The Katz—Sarnak Density Conjecture states that the behavior of zeros near the 
central point in a family of L-functions (as the conductors tend to infinity) agrees 
with the behavior of eigenvalues near 1 of a classical compact group (as the 
matrix size tends to infinity). For suitable test functions, this has been verified in 
many families, including Dirichlet characters, elliptic curves, cuspidal newforms, 
symmetric powers of GL(2) L-functions, and certain families of GL(4) and GL(6) 
L-functions; see, for example, [8, 9, 11, 13, 16, 17, 20, 23, 24, 27, 29, 32, 43, 45, 50, 
51,53, 54, 59, 60]. This correspondence between zeros and eigenvalues allows us, at 
least conjecturally, to assign a definite symmetry type to each family of L-functions 
(see [9, 57] for more on identifying the symmetry type of a family). 

For this work, the most important families studied to date are holomorphic cusp 
forms. Using the Petersson formula (and a delicate analysis of the exponential sums 
arising from the Bessel—Kloosterman term), Iwaniec et al. [27] proved that the 
limiting behavior of the zeros near the central point of holomorphic cusp forms 
agrees with that of the eigenvalues of orthogonal matrices for suitably restricted test 
functions. In this chapter we look at the other GL, /Q family of L-functions, Maass 
waveforms. 
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2 Statement of Main Results 


We first describe the needed normalizations and notation for our families of Maass 
forms, and then conclude by stating our new results and sketching the arguments. 
The beginning of the proofs is similar to that in all families studied to date: one uses 
the explicit formula to convert sums over zeros to sums over the Fourier coefficients 
of the forms. The difficulty is averaging over the family. In order to obtain support 
beyond (—1, 1), we have to handle some very delicate exponential sums; these arise 
from the Bessel—Kloosterman term in the Kuznetsov trace formula. To facilitate 
applying it, we spend a lot of time choosing tractable weights. This is similar to 
previous work on cuspidal newforms where the harmonic weights were used to 
simplify the application of the Petersson trace formula. It is possible to remove 
these weights, and this is done in [27]. For some applications it is important to 
have unweighted families, in order to talk about the percentage of forms that vanish 
at the central point to a given order (see [23]); for our purposes, we are primarily 
interested in obtaining large enough support to uniquely determine the symmetry 
type, and thus choose our weight functions accordingly. 


2.1 Normalizations and Notation 


We quickly recall the basic properties of Maass forms (see [25, 26, 34, 35] for 
details), and then review the 1-level density from the last section with an emphasis 
on the important aspects for the subsequent computations. We use the standard 
conventions. Specifically, by A < B we mean |A| < c|B| for a positive constant 
c. Similarly, A = B means A < Band A > B. We set e(x) := exp(2zix), and 
define the Fourier transform of f by 


7® := [ Flxje(—xé)de. (11) 


In the rest of the paper u always denotes a Hecke—Maass cusp form on Ip(N) 
with N square-free. Thus uv is an eigenfunction of the Laplacian with eigenvalue 
ee (5 + ity) (5 — it,), and it is either even or odd with respect to the involution 
zh —1/z; if wis even we set € = 0, otherwise we take « = 1. Selberg’s 3/16 ths 
theorem implies that we may take t, > 0 or t, € [0, sli. Next we Fourier expand u 
as follows: 


uz) = y? S° an(u)K.-1/2(27|nly)e(ny). (12) 
n#0 
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Let 
ay(u) 
An = ——_., 13 
) cosh(zt,)!/2 (13) 
where the K,(z) are the K-Bessel functions. We normalize u so that A;(u) = 1. 
The L-function associated with u is 
L(s,u) := >, Ann. (14) 


n>1 


By results from Rankin—Selberg theory the L-function is absolutely convergent in 
the right half-plane ‘¥(s) > 1 (one could also use the work of Kim and Sarnak 
[30, 31] to obtain absolutely convergent in the right half-plane R(s) > 71/64, 
which suffices for our purposes). These L-functions analytically continue to entire 
functions of the complex plane, satisfying the functional equation 


A(s,u) = (-1)£ AQ —s,w), (15) 
with 
AG = aT (A) (Fe) L(s, 4). (16) 
Factoring 
1—A,X + X? =: (1—a)X)(1 — BpX) (17) 


at each prime (the a,, £, are the Satake parameters at p not dividing the level N; if 
p divides the level, then either a, or B, is zero), we get an Euler product 


L(s,u) = |] —app*)'0 - Bp), (18) 
Pp 


which again converges for ‘i(s) sufficiently large. 

For the remainder of the paper Ay denotes an orthogonal basis of Maass cusp 
forms on I9(N), all normalized so that A; = 1; thus, Ay is not orthonormal under 
the Petersson inner product on the space. Note we do not take a basis of newforms— 
that is, the delicate sieving out of oldforms as in [27] is not done. Using Weyl’s law, 
however, one can control the sieving (see [25]). 

We use the notation Avg(A; w) to denote the average of A over By with each 
element u € By given weight w(u). That is, 


Dea A(u)w(u) 


Avg(A;w) := Sew 


(19) 
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Our main statistic for studying the low-lying zeros (i.e., the zeros near the central 
point) is the 1-level density; we quickly summarize the needed definitions and facts 
from Sect. 1.3. Let @ be an even Schwartz function such that the Fourier transform 
d of ¢ has compact support; that is, 


og a p(xe 2 dx (20) 


and there is an 7 < oo such that é(y) = 0 for y outside (—n, 7). 
The 1-level density of the zeros of L(s, u) is 


l 
Di (u,o.R) = De( ery). (21) 
Q 


where 9 = 1/2 + iy are the non-trivial zeros of L(s, u), and log R is a rescaling 
parameter related to the average log-conductor in the weighted family, whose choice 
is forced upon us by (46). Under GRH all y are real and the zeros can be ordered; 
while GRH gives a nice interpretation to the 1-level density, it is not needed 
for our purposes. As @ is a Schwartz function, most of the contribution comes 
from the zeros near the central point s = 1/2. The different classical compact 
groups (unitary, symplectic, and orthogonal) have distinguishable 1-level densities 
for arbitrarily small support; however, the 1|-level densities for the even and odd 
orthogonal matrix ensembles are equal for test functions whose Fourier transforms 
are supported in (—1, 1). There are two solutions to this issue. One possibility is to 
perform a more detailed analysis and “extend support.” The other is to study the 2- 
level density, which Miller [42, 43] showed distinguishes the orthogonal ensembles 
for arbitrarily small support. For some of the families studied below we are able 
to calculate the support beyond (—1, 1), and we may thus determine which of the 
orthogonal groups should be the symmetry group; for the other families our support 
is too limited and we instead study the 2-level density. 


2.2. Main Results 


Similar to how the harmonic weights facilitate applications of the Petersson formula 
to average the Fourier coefficients of cuspidal newforms (see, for instance, [27, 44]), 
we introduce nice, even weight functions to smooth the sum over the Maass forms. 
As we will see below, some type of weighting is necessary in order to restrict to 
conductors of comparable size. While our choice does not include the characteristic 
function of [T,2T], we are able to localize for the most part to conductors near T, 
and are able to exploit smoothness properties of the weight function in applications 
of the Kuznetsov trace formula. Further, in problems such as these the primary goal 
is to have as large support as possible for the Fourier transform of the test function 
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that hits the zeros. For more on these issues see [2], where Alpoge and Miller impose 
even more restrictions on the weight functions, which allows them to increase the 
support. 

We consider the averaged one-level density weighted by two different weight 
functions of “nice” analytic properties. Let H € C® ((—4. ¢)) be an even smooth 
bump function of compact support on the real line (thus it should be close to 1 in 
some region and decay very quickly to zero outside there), and let H be its Fourier 
transform. We may of course (by applying this construction to a square root—recall 
that the support of a convolution is easily controlled) take H > 0. We may also take 


H to have an order K zero at 0. Let 
r 
Hy(r) i= Hi) (22) 


This is essentially supported in a band of length = T about +7. 

Next, in the same way, let hecm ((—4. ‘)) be even. We also require h to have 
an order at least 8 zero at 0. Note that, by the same process as above, we may take 
h(x) = 0 for all x € R and also (by Schwarz reflection) h(ix) = 0 for all x € R. Let 


T be a positive odd integer. We let 


h 
hr(r) := : (23) 


This is the same test function used in [2], and is essentially supported in a band of 
length =< T about +7. 
By trivially bounding the Fourier integral we observe that 


A(x + iy), h@+iy) « exp (>) : (24) 
Hence 
Hr(ir) < exp (=) ; (25) 
and, using sinh(x) >> e!*!, we find 
hr(r) < exp (-2) , (26) 
AT 


These will both be useful in what follows. 

In one-level calculations we will take an even Schwartz function ¢ such that b is 
supported inside [—7, 7]. We suppress the dependence of constants on h, H, ¢, and 
7 (as these are all fixed), but not T or the level N since one or both of these will be 
tending to infinity. 
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We weight each element u € Ay by either Hr (t,)/||u||? or hr(t,)/||ul|?, where 


1 dxdy 


[lel|? = Well; ~ [SL2(Z) : To(N)] oe 
TMD [SLA(Z): MO) Inoy\o 


(27) 
is the L?-norm of u on the modular curve Yo(N), and, as before, A, = 7 + - is the 
Laplace eigenvalue of u. Recall that 


[SLo(Z) : oN] =: vw) = [][@+ dD. (28) 


p|N 


The averaged weighted one-level density may thus be written (we will see in (46) 
that R =< T°N is forced) 


D (By. o,. Rw) := Avg (Diu, $, R); w(tu)/| lu?) . (29) 


where w(t,) is either Hr(t,) or hr(t,). 

The main question is to determine the behavior of D, (Ay, d, R; w) as either the 
level N or the weight parameter T tends to infinity; specifically, one is generally 
interested in the corresponding symmetry group. There are now several works [9, 
28, 29, 57] which suggest ways to determine the symmetry group. For our family, 
they suggest the following conjecture. 


Conjecture 2.1. Let hr be as in (23), @ an even Schwartz function with b of 
compact support, and R = TN. Then 


lim D; (Ay, ¢,Riw) = i $(t) Wi o(f)dt, (30) 
R>oo R 
where Wi 9 := 1+ 550. In other words, the symmetry group associated with the 


family of Maass cusp forms of level N is orthogonal. 


In [2] the above conjecture is shown for N = 1,w = Ar, with the extra 
restrictions that h has 2K > 8 zeros at the origin and supp(¢) C (—2 + eae 2 


sen): Our first result here is in the case where the level N tends to infinity 
(remember that N must be square-free), T and K are fixed, and w = wy equals 
either h7 or Hr. 


Theorem 2.2. Fix T and K and let R = N with N square-free. Let H be an 
even, non-negative function with K > 8 zeros at 0 and Fourier transform He 
Cm ((—7. +)), and let h be an even function with 8 zeros atOandh € C® ((—4. +). 
Let the weights w = wr be either Hr or hr, where these are the functions 
given by (22) and (23), respectively. Let @ be an even Schwartz function with 


supp($) © (—3, 3). Then 
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fim, Di@y.d.Rewn) = [ oOMol0de G1) 


N_ square-free 


Notice that the support in Theorem 2.2 exceeds (—1,1), and thus we have 
uniquely specified which orthogonal group is the symmetry group of the family. 

Next we investigate the case where N is fixed and T tends to infinity through odd 
values. For ease of exposition we take N = 1. 


Theorem 2.3. Let h be an even function with 8 zeros at 0 and he ce ((—4. +)), 


and define hr as in (23). Let @ be an even Schwartz function with supp(@) € (-1,)), 
and take R x T? (here N = 1). Then 


lim, Dy(Ai.$-Rihr) = [ $()W, o(adt. (32) 


T odd 


We also get a similar, though slightly weaker, result for the weight function w = 
Hr if we allow K = ord,<o H(z) to vary. For the argument given we invoke the work 
of [27] twice, since we reduce the Bessel—Kloosterman term of the Kuznetsov trace 
formula to sum of Kloosterman terms arising in the Petersson trace formula. Since 
[27] use GRH (specifically, for Dirichlet L-functions and L-functions associated 
with symmetric squares of holomorphic cusp forms), we must, too. 


Theorem 2.4. Assume GRH for Dirichlet L-functions and symmetric squares of 
holomorphic cusp forms of level 1. Let H be an even, non-negative function with K 
zeros at O and Fourier transform Hece ((-4, +)), and let the weights w = wr 
be Hr, which is given by (22). Let @ be an even Schwartz function with supp(?) Cc 
(-I+ are 1- sR): Take R x T* (here N = 1). Then 


jim Di(A1, $, R Hr) = [ somo (33) 


Notice the support in Theorems 2.3 and 2.4 is too small to uniquely determine 
which orthogonal symmetry is present (this is because the one-level densities of the 
orthogonal flavors all agree inside (—1, 1)). At the cost of more technical arguments, 
Alpoge and Miller [2] are able to extend the support beyond (—1, 1) when weighting 
by hr, thereby determining the symmetry group to be orthogonal. In this work 
we instead compute the 2-level density, which provides a second proof that the 
symmetry type of the family of Maass cusp forms on SL2(Z) is orthogonal. The 
2-level density is defined in (36). As any support for the 2-level density suffices to 
uniquely determine the symmetry group, we do not worry about obtaining optimal 
results. 


Theorem 2.5. Let wr equal hy or Hy, R X T? (here N = 1), and let 


1 ; 
MY) ye (34) 


nal 2 
Dee Tuuli2 “S41 ||] 
(-1)&=-1 
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be the weighted percentage of Maass forms in &, with odd functional equation. 
Write 


Dx(A, 91,62, Ri wr) += Avg (D2(u, $1, 62, R): wr(tu)/||ull’) . (35) 


with 


D2(u, $12.8) = Yoo (Sy 1) (Sy ') 


ify 
= Dy,(u, $1, R)D\(u, 2, R) 
— 2D, (u, $12, R) 
+ 6¢1¢1(0)2(0), (36) 


the average 2-level density of the weighted family of level 1 Maass cusp forms, and 
the 2-level density of u € Ay, respectively. Let f * g denote the convolution of f 
and g. Then, for 6 « 1 and ¢,, $2 even Schwartz functions supported in (—6, 6), 


$2(0) 
2 


jim Dt Ai.dr.da. Rwy = (22 + G0) (2° + 0) 


42 i 1b x) Pala) 


— (1 —.V(-1))g1(0)G2(0) — 21 * 62) (0), 
(37) 


agreeing with the 2-level density of the scaling limit of an orthogonal ensemble with 
proportions of NV (—1) SO(odd) matrices and 1 — WV (—1) SO(even) matrices. 


A similar result holds for 4y—all the calculations will be standard given our work 
on the one-level densities. 


2.3. Outline of Arguments 


By a routine application of the explicit formula we immediately reduce the problem 
to studying averages of Hecke eigenvalues over the space of Maass cusp forms of 
level NV. For this we apply the Kuznetsov trace formula, as found in [33]. We are 
quickly reduced to studying a term of shape 


vin) S(m, ae fs (=) rwr(r) a (38) 


a cosh(zr) 
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In all cases the idea is to move the contour from R to R — iY with Y — oo. The 
properties of the weights hr or Hy ensure that the integral along the moving line 
vanishes in the limit, so all that is left in place of the integral is the sum over poles, 
of shape (up to negligible error in the case of hr, which also has poles of its own) 


Deane (A) t+ Dwr (AY), G9) 


‘N 
k>0 . 


Now the N — oo limit is very easy to take, as all the Bessel functions involved 
have zeros at 0. So the term does not contribute (the total mass is of order T?v(N), 
canceling the v(N) out in front). With some care we arrive at Theorem 2.2. 

If instead we take N = 1 and wr = Hr, then, by standard bounds on Bessel 
functions, Jox41 ea) is very small for k larger than = = For us ./m will always 
be bounded in size by something that is =x 7”. Thus for k smaller than this range, 
the Bessel term is still controlled but not too small. It is the term 


2k+1 2k+1 EY" 
——i]) x~aA j — 40 
vr( 2 ') ( or ') > (z) oa 


that is small. Upon taking 


K > = (41) 
In 

it is in fact small enough to bound trivially. For slightly larger support we instead 
appeal to the bounds of [27] on sums of Kloosterman sums, which are derived 
from assuming GRH for Dirichlet Z-functions. This gives Theorem 2.4. In fact, 
the expression we get is exactly a weighted sum of terms appearing in [27] from 
the Kloosterman terms of Petersson formulas. It would be interesting to find a 
conceptual explanation for this. 

The proof of Theorem 2.3 is a simplified version of the argument given in [2], 
except considerably shortened—instead of delicate analysis of exponential sums, 
we just use Euler—Maclaurin summation. As one would expect our support is thus 
smaller than that in [2], but the argument and main ideas are significantly easier 
to see. 

The proof of Theorem 2.5 follows from the previous results and another 
application of the Kuznetsov formula, this time to the inner product of T,. with 
Tv with p, q primes. 


3 Preliminaries for the Proofs 


In this section we compute and analyze some expansions and resulting expressions 
that are useful in the proofs of our main theorems. We start in Sect. 3.1 by using the 
explicit formula to relate the sum over zeros to sums over the Hecke eigenvalues of 
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the associated cusp forms. The weights and normalizations are chosen to facilitate 
applying the Kuznetsov trace formula to these sums, which we do. After trivially 
handling several of the resulting terms, in Sect. 3.2 we analyze the Bessel function 
integral that arises. We then use these results in Sect. 4 to prove the stated theorems. 


3.1 Calculating the Averaged One-Level Density 


We first quickly review the computation of the explicit formula; see [27, 56] for 
details. Let u € Ay, and for an even Schwartz test function ¢ set 


1 
s—;)logR 
(5) 1= (am. (42) 
20 
Consider 
A’ 
P(s) —(s, u)ds. (43) 
o=3 A 
By moving the integration too = —} and applying the functional equation, we find 
that 
A’ 
2 | - (s,u)P(s)ds = D(u, d) (44) 
3 
o> 


(use the rapid decay of ¢ along horizontal lines and Phragmen—Lindel6of to justify 
the shift). After expanding the logarithmic derivative out in the usual way, applying 
the Kim-Sarnak bound, and noticing that 4,2 = r; — xo(p) for Zo the principal 
character modulo N, this equality simplifies to 


2 
pia = 20) 4 0) (EN TORR) 4 ( SEtEA + Melee’) 
ogR logR 


»¢ log p » 5(Te) 
+2 : . (45) 
yy eee logR 


Thus, if wr is essentially supported on = T (as are hy and H7), the averaged 
one-level density is (since ||u|| = 1 under our normalizations, by Smith [58]) 


g log(7°N log log R + log log N 
D,(Bv.¢,Riwr) = oO G10) (EE) + 0 glog R + loglog ) 
R logR 
£292 PEP 5 (TEP) mvaQycwn. 6 
VE(A,05 W 
(=| p pi logk” log R — 


Notice the above computation tells us the correct scaling to use is R x T*N. 
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The difficulty is in determining the averages over Hecke eigenvalues. For this we 
use the Kuznetsov formula for Ay. Let wr equal hr or Hr. 


Theorem 3.3 (Kuznetsov Trace Formula (See [33], p. 86)). Let m € Z*. Then 


yo ye ID fer tantra 


pa ae 


dr 


7 [ 22 DEE. Onin 
a IG IPG + NP 


a. 
(ip)piw EXO, Leo) 


yee 7 (=) rwr(7r) ae 


cosh(zr) 


cl 


(47) 


where S is the usual Kloosterman sum, (ip) |v runs through all 0 < ip < 1 with p 
ranging over the prime factors of N, 


Gir(a, (ip)) = 
—1-2ir - 
IT ip = aS Hpi P ") a e| - af 
P dir d TI Iv pi’ ? 
pP\|N dla fe(Z/ [pw '?Z) = P 
(48) 
J is the usual Bessel function, and 
1-i 
. p N 
GIP = = = (49) 


p|N LP v) Taw p? 


In our applications we always have (m, N) = 1 since we only take m = 1, p, or 
p*, which means that the contribution from the principal character in the definition 
of 6;, may be ignored. Also the inner sum in 6;,(a, (ip)) is of the form 


> (=) = p(n) < 1. (50) 
ge@jnzyx \" 
Hence, bounding trivially and noting that our m have at most three divisors, we find 


-1 


Birla, (in) < {[[p?] - (51) 


pIN 
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Also, by work of de la Vallée Poussin on the prime number theorem, ¢(1 + 2ir) > 
log(2 + |r|)~!. Hence the second term in (47), the Eisenstein contribution, is 


v(N 1 
ee UN) y [wr togt2 + lar 
(ip)pin T1wP” 


v(N)T log T 1 
co ss) 


pIN 


= (yr T (52) 
= 7 og T. 


In our applications we will always divide these expressions by the corresponding 
expression with m = 1, which gives the total mass of the family (“the denominator” 
in the sequel). We will see that it is of order =x T*v(N) (see Corollary 3.4). Hence, 
since it will be divided by something of order x T?v(N), the Eisenstein contribution 
is thus negligible for N or T large. 

Note that the diagonal term (that is, the first term of (47), with m = 1) is 


vn) f nent) tanh(ar)dr < T?v(N). (53) 


Hence to show the claim about the total mass it suffices to bound the last term of (47) 
in the case of m = 1. 

We have therefore reduced the computation of the weighted 1-level density to 
understanding the “Bessel-Kloosterman” terms. We isolate this result below. 


Lemma 3.2. [fm = 1, p or p? is coprime to N and wr equals hr or Hr, then 


py Amu) Wr(ty) = Sm. : (= TN) 


ean Mr 


+ SOO ye ee fa (=) rwr(r) ar 


= cosh(zr) 


(54) 


where by ° (= T’N) is the product of a term on the order of T?N with Kronecker’s 
delta. 
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3.2. Handling the Bessel Integral 


As in [2], the technical heart of the analysis of the Kuznetsov formula is the 
following claim, which relies on the analytic properties of hr and Hr; see Appendix 
1 for a proof. 


Proposition 3.3. Let T be an odd integer. Let X < T. Let wr equal hy or Hr, where 
these are the weight functions from Theorems 2.2 through 2.5. Then 


rwr(r) ine 
[00 SO ar = ¢ SoH Dog (X) (2k + l)wr ((«+ 5) ) 


k>0 


$eaT? Yh (-1)oer EA) (55) 
k>1 
=¢ S\(- Doe (X) (2k + 1)wr (i + 5) ) 
k>0 
[+0 (Xe°")], (56) 


where C,, C2, and c3 are constants independent of X and T, and the terms in brackets 
are included if and only if wr = hr. 


Since Bessel functions of integer order are much better-studied objects than those 
of purely imaginary order, this is a useful reduction. The calculation also realizes the 
Kloosterman term in the Kuznetsov formula as a sort of average (though the “weight 
function” is growing exponentially in the case of H;) of Kloosterman terms arising 
in Petersson formulas over all even weights. 

For what follows we quickly note the following corollary, which determines the 
size of the denominator (total mass) mentioned above. 


Corollary 3.4. Let wr equal hy or Hy (as above). Then 


y- wrt) py(Ny, (57) 


2 
ZS Tall 


Proof. It suffices to show that 


“ > Bus : No SoD oes (=) (2k + 1)wr ((«+ 5) ) < as 


c>1 k>0 


To see this, we bound trivially by using 


(x/2)* 


Ji (x) <K kl ’ 


(59) 
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and 


(+1 - 
S T 
sin ( as n) > (60) 


in the case of wr = hr. 


4 Proofs of the Main Theorems 


Using the results from the previous section, we can now prove our main theorems. 
All arguments begin with the following reductions. We first use (46) to reduce the 
determination of the 1-level density to that of sums of the weighted averages of A, 
and Ag. We then use the Kuznetsov trace formula (Theorem 3.3) to analyze these 
sums. By Lemma 3.2 we are reduced to bounding the contribution from the Bessel— 
Kloosterman term, to which we apply Proposition 3.3 to analyze these exponential 
sums. We now turn to the details of each of these cases. 


4.1 Proof of Theorem 2.2 


It suffices to study 


2 

2logp ~ (£logp\ v(N) S(p‘, 1; Ne) 

oi yy (ey 
t=1 p p2logR og c 


c>l 


Se Dida (2 =) ok + nr (#4), (61) 


k>0 


and bound .Y by something growing strictly slower than v(NV). This is because we 

get to divide this term by the total mass, which by Corollary 3.4 is of the order 

T°v(N). As T is fixed, we are dividing by a quantity on the order of v(N). 
Bounding trivially, we find 


? _ See (270)! wp (Si) ¥ 3 rd CN) Fte-2k-1 yk 


Qk! 
cel k>0 £=1 p’/2<Rn/2 


v(N) (Qx)* QA+1L\ REED 
nix Ger 2 NlogR 


< 


3 
n—st+e72n 
N 2 T oO (TN) ; 


< v(N)——___ 
»( Nog + logT 


(62) 


As T is fixed, the above is negligible for 7 < 3/2, which completes the proof. O 


40 L. Alpoge et al. 
4.2 Proof of Theorem 2.3 


It suffices to study 


2 
logp » ( llogp S(p*, 1;c) 
F = Dy ee g (Teeny ee 
t=1 Pp p2 logT og c 


c>l1 


OED ett (=) (2k + Ihr (=F) (63) 


k>0 


and bound .Y by something growing strictly slower than T*. This is because N is 
fixed, so by Corollary 3.4 the denominator that occurs in the weighted averages is 
on the order of T?. 

In [2] (see their (43)-(46)—for the convenience of the reader, this argument is 
reproduced in (Appendix 2), it is proved that 


one (=) (2k + Ahr (= ; 


k>0 


+ 
~. 


=: c4TS;,(Y) + O(Y), (64) 


where h(x) := x7h(x) and Y := 2p"/?/c. We apply the Euler—Maclaurin summation 
formula to the first term, yielding 


NIN 


In differentiating the expression 


e(rsin (3) i( cos (**)) en 


Maass Waveforms and Low-Lying Zeros 41 


k times, the worst case is when we differentiate the exponential every single time 
: k : ; 
and pick up a factor of (=) ; otherwise, we gain at least one factor of T (remember 


that Y should be thought of as order T”). Hence we may bound the error term by 


(«(v sin (22)) (= cos (%))) "ar < (FZ) 


Taking M > 1+ = the error term is thus O (Y/T). 

Next, by the same analysis, in the second term of (65) we either differentiate the 
exponential every single time, or we gain a factor of T from differentiating h or one 
of the cos (17a/T)’s produced from differentiating the exponential. Thus, since we 
differentiate at least twice, all but one term in the k-fold derivative is bounded by 
Y/T?. The last remaining term, obtained by differentiating the exponential k times, 


vanishes because h has a zero at 0. 
Hence it remains to bound the first term of (65). This we do by integrating by 
parts, via 


«) —~_ 1 fw (f@) 
Je f(xjdx = ai e g'@) dx. (68) 


nS 


K =. (69) 
Hence we obtain the bound 
Y 
Sr(Y) <« i (70) 


Thus 


Anp'/? 2k +1 ve 
Teds ( P ) ek + Dr ( ; ) et, (71) 


k>0 
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That is, this tells us that 


2 
Bs log p 3 Llogp soe 
p'/?logT” \ 2logT = c 


e/2 
Te (F2—) ek + dir 7 “i) 


k>0 
logp Llogp 
i Panes 2log T gone 


« 3 Ds ee 


£=1 jee 


1 heal 


< : 
log T 


(72) 


For 7 < 1 this is negligible upon division by the total mass (which is of order T’), 


completing the proof. 


4.3 Proof of Theorem 2.4 


oO 


Before proceeding it bears repeating that the same limiting support as K gets large 
(namely, (—1,1)) can be achieved by just trivially bounding as above (that is, 
without exploiting cancelation in sums of Kloosterman sums), but we present here 
an argument connecting the Kuznetsov formula to the Petersson formula as studied 


in [27] instead. 
It suffices to study 


2 
logp »/(logp S(p*,. 1: c) 
SF = 
Dare ae dX, c 


Ye pide (2 =) ok + nite (4 “i) 


k>0 


and bound .Y by something growing strictly slower than T?. Let 


Ok(m;c) = 2x PSV: Oe (=e*) 3 (8) 2logp 
P 


logR 


/p log R’ 


(73) 


(74) 
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exactly as in [27]. Then this simplifies to (dropping overall constants) 


2k+1 
Let Dees pa ( S ') Or s2lt30 (75) 


2T 
c>1 k>0 


if we ignore the £ = 2 term, which is insignificant by, e.g., GRH for symmetric 
square L-functions on GL3/Q (as in [27]). 
In Sects. 6 and 7 (see (7.1)) of [27] they prove 


Theorem 4.1. Assume GRH for all Dirichlet L-functions. Then 


Ox (mc) K fe(ZmT"k (log(2c))* (76) 
where 
(2) Q2* k>3z (77) 
4 
ut k-/2_ otherwise 
and 
4nT" 
—— a (78) 


Hence the sum over c now converges with no problem, and we may ignore it. 
What remains is 


T!) \(QQk+ 1)H (= ue ‘) Ve(zké 


a 2T 
Ite py (2k41 
= et4,, (2k 41. ki¥<H (2°) 
O<k « TI Tl <k 


The second term in parentheses poses no problem. For the first term, using 
H(x) « xX for 0 < x < 1, we see that the above is bounded by 


T! oy pet atk eK <K T31-K (80) 
O<k « T7 
Thus we need 
2+K 1 
n< =4 (81) 
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Again, by taking K even larger we could have just trivially bounded throughout and 
not invoked [27] or GRH, but the connection noted above may be of independent 
interest. 


4.4 Proof of Theorem 2.5 


By definition 


D2(u, b1,$2,R) = Di(u, $1, R)Di(u, $2, R) 
— 2D, (u, o1b2, R) + 5,11 (0)b2(0). (82) 


Averaging, we see that for 1 and b> of sufficiently small support (actually (-3, +) 


would work fine), given our results above on one-level densities, up to negligible 
error 


D»(Ai, 61, ¢2,R:wr) = Avg (rv o1, R)D (u, b2, R); wae) 


— (1 —-V (-1))91(0)2(0) — 261 * $2(0) + o(1). (83) 


Now 


D(u, 1, RD (u, g2,R) = 


eat 7 2) (0) < 2logp ~ (Llogp 
$1 (0) as Anh 
(a 2 -2 ee pilogR er logR 


log(1 +t @(0) < 2logp . x (cee) 
0 + = i, . (84 
(Fo) EE 4 BO yy OEr yh (GREE) 


Since wr is supported essentially around ¢,, = T, the log(1 + #2) terms are all, up 
to negligible error, approximately log R (again we invoke the bound of [58] on the 
L?-norms occurring in the denominator). Also by an application of the Kuznetsov 
formula, now with the inner product of two Hecke operators (namely T,¢ and Tj 
for p, g primes and 0 < £, £’ < 2—this uses the results on the Bessel—Kloosterman 
term established above), we see that the resulting average is, up to negligible error, 


wr(tu) 
[|u| |? :) = - 


$1 (0) $2(0) 4log*p ~ (Llogp\ ~ (Llogp 
(fort ) (# ae re +>y4 ai( logR a ( logR ) 


22 og R® 


Avg (rv $1, R)D1(u, b2, R); 


(85) 
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That is, only the diagonal terms p’ = q’ matter. Now partial summation (and the 
prime number theorem, as usual) finishes the calculation. 


Appendix 1: Contour Integration 


We prove Proposition 3.3 below. We restate it for the reader’s convenience. 


Proposition 5.1. Let T be an odd integer and X < T. Let wr equal hy or Hr, where 
these are the weight functions from Theorems 2.2 to 2.5. Then 


1 
i J rir(X rar = cr 01) nei (X) (2k + Dwr ((«+ 5) ) 


k>0 


+ 6oT? Yo (-1) oer (XP) (86) 
kel 
= ¢ SoD ants (X) (2k + I)wr ((é + 5) ) 
k>0 
[+0 (Xe~7)], (87) 


where C1, C2, and c3 are constants independent of X and T, and the terms in brackets 
are included if and only if wr = hr. 


Proof. In the proof below bracketed terms are present if and only if wr = hr. 
Recall that 


_ (- 1 yny2mtro 
1) = areal) = 


m>0 


By Stirling’s formula, (m+ 2ir+ 1) cosh(srr) > |m+ 2ir+ ymtaem™, Hence by 
the Lebesgue Dominated Convergence Theorem (remember that wr(z) is of rapid 
decay as |{tz| + oo) we may switch sum and integral to get 


i: Ji (X )\——— sas, dr 


cosh(zr) 
= (pn x7" rwr(r) 
7 dX m! [ I'(m+ 14 2ir) ein (89) 


where X =: 2x. 
Now we move the line of integration down to R— ik, R ¢ Z + ; (and ¢ TZ if 
wr = hr). To do this, we note the estimate (for A > 1) 
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x "rwr(r) 2R 
mm Ii 
i > iR I'(m + 1 + 2ir) cosh(zr) ax f >+A ° as ad 


+1)-""2 wr) [dr 
< rrA”, (90) 


where again we have used the rapid decay of wr along horizontal lines, and B — 
B —iR denotes the vertical line from B € C to B—iR € C. (Rapid decay also ensures 
the integral along R — iR converges absolutely.) 

Note that the integrand 


2ir 
sama, (91) 
I'(m+ 1+ 2ir) cosh(sr) 
has poles below the real axis precisely at r € —i(N + 5) = {-31,- i,...}, and, if 
wr = hr, poles also at r € —iTZ*. The residue of the pole at r = i (k => 0) 
is, up to an overall constant independent of k, 
2k 2k+1 
—1)'2k+1 i). 92 
Fim +14 k+ D0) Okt wr( 2 ) (92) 
If wr = hr, the residue of the pole at r = —ikT (k => 1) is, up to another overall 


constant independent of k, 


2kT ( ) 4 2kT 


ky,272 = 
D'(m + 1+ (2kT)) “) cos(kT) (m+ 1 + (2kT)) 


RT*h(k). (93) 


Hence the sum of (89) becomes 
> (—1)"x?" i x" rwr(r) 
m! I'(m+ 14 2ir) anon 


= 3 (—1)"x2" y 2k+1 Crytar+ tw 
: ml! C+ 1+OkS 1) . 


m>0 O<k<KR 
2k+1, Z [ x" rwr(r) 
a a fGie i + mcohGny 
(—1)"x2" 2kT ce 
+e » > (—1)'k?T*h(k) | J. 
m>0 m! O<k<KR r(m + | + (2kT)) 


(94) 
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Now we take R — oo. Note that 


x2it ‘rwr(r) OR N 
ral re 
[ in (m+ 1 + 2ir) cosh(zr) as oe Vln) lees | : 


Km Re ROM IR (95) 


again by Stirling and rapid decay of wr on horizontal lines (that is, wr(x + iy) < 
(+ x)~ 4e%r since both h and H have all their derivatives supported in (-;. +))- 
This of course vanishes as R > oo. 


Hence we see that 


i pee a 
R 


cosh(zr) 
_ (—1)"x2”" ektl k+1 
ad «De m! 2 Tom+1+Qk+ py DKF Dwr ( ' 
(- 1 iat a 4 2kT ; 
+o), m! d Tim+1+ a) * Pay: (96) 


Switching sums (via the exponential bounds on wy along the imaginary axis) and 
‘ __ym,2m-+n . - : 
applying J,(X) = ee, or gives us the claimed calculation. For the bound 
on the bracketed term, use 


n na 1—y? 


ae saa 
Upyl=yy 


Jn(ny) « (97) 


(see [1], p. 362) and bound trivially. 


Appendix 2: An Exponential Sum Identity 


The following proposition and proof are also used in [2]. 


Proposition 6.1. Suppose X < T. Then 


SH(X) : 


II 


Va Xe 1)‘Jont (X)— 


k>0 in (24) 


Ciol Yie (v sin (= =)) 2 (= cos (“*)) + O(Y), (98) 


lal<F 


II 


where Cio is some constant, g(x) := sgn(x)h(x), X =: 27Y, and for any f set 


FQ) := af). 


48 L. Alpoge et al. 


Proof. Observe that k +» sin(zk/2) is supported only on the odd integers, and 
maps 2k + 1 to (—1)*. Hence, rewriting gives 


- zfk\ sin (2) 
S)(X) = T Me Ii(X)h (==) sin (28) (99) 
k@2TZ. 
Since 
! 4 T-1 
sin (4) ee - et - mike 
. a ik nik aad (100) 
sin (=) en —o a 
a=—(> 


when k is not a multiple of 27, we find that 


Sy =T>Y ye e(F) TAXh (<7). (101) 


T k>0 
lal<> pare 


Observe that, since the sum over @ is invariant under ~w +» —a (and it is non-zero 
only for k odd!), we may extend the sum over k to the entirety of Z at the cost of a 
factor of 2 and of replacing h by 


g(x) := sgn(x)h(x). (102) 


(This is because of the identity J_,(x) = (—1)*J;(x) and the fact that we need only 
consider odd k.) Note that g is as differentiable as h has zeros at 0, less one. That 
is to say, g decays like the reciprocal of a degree ord,=o h(z) — 1 polynomial at oo. 
This will be crucial in what follows. 

Next, we add back on the 2TZ terms and obtain 


TY De(F) none (=F) — 1D rrcorenay 


lal<t keZ keZ 


550%) 


ll 
| 
Mm 
—— 
N] = 
S18 
See 
= 
om 
— 
N 
ape 


=) +0 (xe sat) 


=: V)(X) + O(Xe~""), (103) 


where we have bounded the term T? )°,¢7 Joxr(X)K°A(k) trivially via J,(2x) <« 
x"/n!. 

Now we move to apply Poisson summation. Write X =: 27 Y. We apply the 
integral formula (for k € Z) 


J,QQmx) = ie e (kt — x sin(271)) dt (104) 


1 
2 
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and interchange the sum and integral (via rapid decay of g) to get that 


V(X) = T > i (=-(F x i) 2 ()] e(—Ysin(2rt)) dt. (105) 


keZ, 
By Poisson summation, (105) is just (interchanging the sum and integral once 
more) 


VO) >) a ; @" (2T(t—k) +a) e (—Y sin(2z1)) dt 


1 
lal<4 keZ 2 


= ent > 7 2” (He (¥ sin (= + =)) dt 


ree T T 
=: cyW,(X). (106) 
As 
sin (= + =) — sin ( 7 ) + = cos (=*) _ as sin (=) +0 . : 
(107) 
we see that 
W,(X) 
= ¢3T > e (Ysin (=) [- 2" (the (= co (“*)) di 
lal< = 
Y a= 
re ee . )) (>) [. Pe" (the (. (“")) dt 
Yo 
+0(7+5) (108) 
= ost D> e(rsin(32)) 3 (00s (32)) 
lal<> 
ate x e (¥sin ( )) sin (=) (= cos ( )) 
lo|<Z 
¥ - 
+0 (5 + =) . (109) 


Now bound the second term trivially to get the claim. 
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Appendix 3: 2-Level Calculations 


The purpose of this appendix is to provide additional details to the 2-level 
computation of Sect. 4.4. Letting 


Sea ee (ee ) 
p 


p'/2logR logR 

21,2(u;)logp . (2logp 
S2(uj, 9) = as i 110 
207.) = OTTER i (e) (110) 


and summing over the family, a standard calculation reduces the determination of 
the 2-level density to understanding 


hr (tj) io 
-_ > 2. Hl 
Lite 7 ||| 


Tee +H 


SO Wy, 6) — SS (uj, 6) 


2 
o (ee) | (111) 
logR 

(the other terms are straightforward consequences of the combinatorial book- 

keeping from the inclusion—exclusion argument), where c(1) is the identity map 
and c(2) denotes complex conjugation. 

We can move the factor O(loglog R/ log R) outside the product at the cost 

of an error of the same size outside all the summations. To see this, since 

O(log log R/ log R) is independent of u; these terms are readily bounded by applying 


the Cauchy-Schwarz inequality. Letting .” represent any of the factors in the 
product over i in (111), the product involving this is O(log log R/ log R): 


3 in) g 0 (“ece*) 
ari) a7 
Li z iP j lil ne 


; 1/2 1/2 
hr(t) 6 loglogR ir) g 
< 7 a ~~ lu; 2° logR : = a », \lu; ph” 
i Tal? i J Gj lull J 
1/2 
log ~e) in) gy 
« o( . (112) 
logR y my 2 || 14) aa 
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We now analyze the four possibilities for the sum involving |./|?. If Y is either 


log(1+17 glog : Fe pelea ; F 
$i (0) eae R oro (es ot), this sum is trivially O(1), and thus the entire expression 


is O (see). We are left with the non-trivial cases of Y = S, or -Y = S>. To 


handle these cases, we rely on results that we will soon prove in lemmas below: for 
sufficiently small support |S;|? = O(1) and |S2|? = o(1). Note that we use these 


lemmas for ¢ = ¢)¢, instead of the usual 6 = ¢)¢2, but this does not affect the 


proofs, and thus we may move the O (sees 


) factor outside the product. 
By symmetry, it suffices to analyze the following terms to determine the 2-level 


density: 


thi) og(1 + #) 
5 may > POH (se?) 


1 : yr in) § log(1 + #7) 


Se | a i ea a 


1 h 
a AO Ca, br Seley, da) K€ € {1,2}. (113) 
a me j i“ 


We now analyze these terms. For small support, a similar analysis as performed 
earlier in the paper shows that the first term in (113) satisfies 


hr(t)) ~ og(1 + t7) = 1 ) 
0 0) | ———— ] = 0 0 O 
y a 2 | mek )pr( (5 logR ay = $1( )pr( ) + leeloek 


(114) 
We next handle the terms where we have exactly one S-factor. 


Lemma 7.1. For sufficiently small support, the 1 (0)S\ (uj, 62) and bi (0)S2(uj, $2) 
terms are O(log log R/ log R), and thus do not contribute. 


Proof. The proof is almost identical to the application of the Kuznetsov trace 
formula to prove similar results for the 1-level density, the only change being that 
now we have the modified weight function hr(¢;) log(1 + t?). 


We now turn to the more interesting terms in (113). We first handle the second 
term. 


Lemma 7.2. For sufficiently small support, 


h — loglogR 
= way LOS SC) = 2 c eb (dateyae-+ 0 ( SEPE*) 
J uj? J 


(115) 
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Proof. 


h gos 
»» iE Si} (uj, 1) S1 (uj, 2) 


_ 4y> hr(t)) 3 Ap, (uj)Ap, (uj) log pi DEP G, a, Gea) 
; || «lI es pip Sie log?R logR logR 


logp, logp2 . (logp; log p2 hr(tj) —— 
=a » om Ri log’ R 1 logR logR a ull? An (uj)Ap, (uj). 


Pl po P 


(116) 


As before, we apply the Kuznetsov formula to the inner sum. Since the formula has 
a 5p, p>, We need to split this sum into the case when p; = pz and the case when 
P1 # P2. For small support one easily finds the case p; 4 p2 does not contribute; 
however, the case pj = pz does contribute. Arguing as in the 1-level calculations, 
after applying the Kuznetsov formula we are left with 


1 log” p » (<2) * (22) ic (eee) 
* =2 z\d1(z)2(z)dz + O{ ——— ]. 
dX P log? Pa logR bo ing - I<lbi (<)b2(2) an 


(117) 


(the equality follows from partial summation and the Prime Number Theorem, see 
[42] for a proof). 


Lemma 7.3. For small support, the contribution from the S;.(uj, 61) S¢(uj, 62) terms 
is O(log log R/ log R) if (k, £2) = C1, 2) or (2, 2). 


Proof. The proof is similar to the previous lemma, following again by applications 
of the Kuznetsov trace formula. The support is slightly larger as the power of the 
primes in the denominator is larger. 
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Pour Helmut Maier, qui compte 
par plaisir et partage sans compter. 


Abstract Extending a previous result, we show that, for the friable summation 
method, the Fourier series of any normalized function F' with bounded variation 
on the unidimensional torus converges pointwise to F while avoiding the Gibbs 
phenomenon. We also prove that the convergence is uniform when F is continuous 
and provide an effective bound for the rate when F satisfies a uniform Lipschitz 
condition. 


Keywords Friable integers * Friable summation * Summation methods ° 
Fourier series * Gibbs phenomenon ¢ Functions of bounded variation * Jordan’s 
theorem ¢ Integers free of large prime factors 


2010 Mathematics Subject Classification. 
Primary: 11N25, 42A24, secondary: 42A20. 


1 Introduction et énoncé des résultats 


Soit VB*(T) la classe des fonctions F qui sont 1-périodiques, 4 variation bornée 
sur T = R/Z, et normalisées par F(0) = 5${F(0+) + F(v—)} aux points de 
discontinuité. Nous nous proposons ici de généraliser aux fonctions de VB*(T) 
le théoreéme 5.1 de [2] relatif 4 la convergence friable des séries de Fourier et 
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a l’absence de phénoméne de Gibbs (voir par exemple, [9], vol. I, § II.9, ou [6], 
chapitre 17) pour ce procédé de sommation. 

Désignons par P(n) le plus grand facteur premier d’un entier naturel n > 1, 
avec la convention que P(1) = 1. Nous notons a,(F), b,(F) les coefficients de 
Fourier d’une fonction F de L'(T), et, pour toute y > 2, nous définissons la suite 
des sommes partielles friables de la série de Fourier de F par 


F(9;y) := ao(F) + )> {a,(F) cos(2n) + b,(F) sin(2xnd)} (1) 


P(n)<y 


pour toute valeur de % ot cela posséde un sens. C’est en particulier le cas si 
F e€ VB*(T) puisque |a,(F)| + |bn(F)| « 1/n, ce qui implique I’ absolue 
convergence de la série (1) pour chaque y = 2. 

L’étude de la convergence de F(; y) lorsque y tend vers l’infini s’inscrit dans 
le cadre plus général de celle de la convergence friable des séries. Formellement 
introduit dans [3] puis dans [4], ce procédé de sommation! consistant 4 définir 
la somme comme limite, lorsque le paramétre de friabilité tend vers l’infini, des 
sous-séries restreintes aux entiers friables, a été étudié dans [1] puis de maniére 
systématique dans [2]. Au théoréme 5.1 de [2], nous avons établi la convergence 
simple de F(%; y) vers F(%) pour toute fonction F de VB*(T) sous réserve que 
la dérivée presque partout de F soit dans U,.,L%(T). Le but de la présente note 
consiste a relacher cette derniére condition. 

Il est relativement facile d’exhiber des exemples de fonctions discontinues F 
appartenant 4 VB*(T) ~ UgsL"(T): sif € L! (T) ~ Ugs1L%(T), on peut choisir 


o) 
F(#) =| f(t) dt + BB). 


Un exemple de fonction f admissible est fourni par 


fF) = 3 cos) (2) 


logn 
n22 & 


En effet, un théoréme classique concernant les séries trigonométriques 
Ynez dn e27? dont la suite des coefficients est paire et vérifie identiquement 
An—1 tAnt1 = 2an (voir [5], théoréme 1.4.1, ou [9], vol. I, théoréme V.1.5) implique? 
f € L\(T) alors que, pour tout a > 1, posant r := min(2,a), s := r/(r — 1), 
Vhypothése f € L*(T) impliquerait f € L’(T) et donc (cf., par exemple, [5], 
théoréme [.4.7) 


'Désigné dans ces travaux sous le nom de P-convergence ou P-sommabilité. 


?On peut aussi établir directement via la formule d’Euler—Maclaurin que, pour 0 < |#| < $s ona 


FO) K 1/{9 (log 9)?}. 
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YFP < c, 


né€Z 


une condition manifestement en défaut pour la série (2). 
Nous désignons par @ la fonction de Dickman,’ posons ||v|| := minnez |v — n| 
(v € R) et notons 


oy sin(27n1) (3) 


BO; y) = — a 
P(n)<y 


la somme partielle friable d’ ordre y de la série de Fourier de la fonction de Bernoulli 


_yosin2amd) — { (8)-1sid €Z, 
AOS =| 0 side, o 


m21 


oti (2) désigne la partie fractionnaire du nombre réel v7. 


Théoréme 1.1. Soit F ¢ VB*(T). On a, uniformément pour 0 € R et y = 2, 


FO») - FO) = [yo AEP) — 1} ae) ar@—v)+0(—). (5) 
T ogy log y 


En particulier, ona 


lim F(:y) =F) (8 €T), (6) 
yoo 
lim sup F(?; y) = sup F(?). (7) 
YO Her deT 


De plus, si F est continue, alors F(-; y) tend uniformément vers F. 


Remarque. Bien que l’on ait (7), la convergence de F(v;y) vers F(v) n’est 
en général pas uniforme: il est facile de voir que la série F(¥;y) converge 
uniformément et donc que, pour chaque y > 2 fixé, F(v;y) dépend continiiment 
de v. Il est cependant 4 noter que, contrairement 4 la situation classique, la 
convergence demeure exploitable au voisinage des discontinuités : alors que l’on 
sait, classiquement, que la somme partielle d’ ordre y de la série (4) n’ approche pas 
le membre de gauche pour, par exemple, } = 1/y, on déduit de (5) que 


B(/y;y) -— B(l/y) K€ l/logy = (y 2 2). 


3Voir par exemple [7], chapitre III.5. Rappelons que @ est la solution continue de 1|’équation 
différentielle aux différences ug’(u) + o(u — 1) = 0 avec la condition initiale g(u) = 1 
(0 <u 1). 
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Sous une hypothése supplémentaire de régularité pour F, nous pouvons préciser 
la vitesse de convergence de F(-; y) vers F. 


Théoréme 1.2. Soita > 0. Si F € VB*(T) est uniformément lipschitzienne 
d’exposant a > 0, ona 


1 
max |F@; y) -—FO)|<« yr (y = 2). (8) 


2 Preuve du Théoréme 1.1 
Pour toute fonction F de VB*(T) et tout entier n > 1, ona 
a,(F) cos(22nt) + b,(F) sin(2anvd) = 2 | cos(27n(3 — v))F(v) du 
T 


— [ BORO) aeeyy = — f PCE arco — vy, 
a T wn 


wn 


Cette quantité étant trivialement < 1/n, on peut sommer pour P(n) < y. On en 
déduit que l’on a, pour tout y > 2, 


F(:y) = ao(F) + f B(o:y) AF —v). 0) 
Par ailleurs, une simple intégration par parties fournit, dés que F € VB*(T), 
F(d) = a(F) + f B00) AFCO ~). (10) 
En effectuant la différence de (9) et (10), nous obtenons 
Fv») ~ FO) = [ View: AF ~v), (1) 
ot l’on a posé 


Vi(d; y) = BCD; y) — BY). (12) 


Or, nous avons établi dans [2] que l’on a, uniformément pour 7 € R, 


1 
Vi(:y) = falus,) — BC) + O(——). = 
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oll nous avons posé 


= PEMD gy eRdZ) 


logy 


et convenu que us, = oo sid € Z. Il s’ensuit que 


Fi») ~ FB) = [ felut,) ~ 19800) AFH —v) + Or(1/Togy). (14 


Comme {o0(u; ,) — 1}B(v) tend simplement vers 0 sur T, la relation (5) implique 
clairement (6), en vertu du théoréme de la convergence dominée. 
Montrons maintenant (7). Il résulte immédiatement de (6) que 


lim sup sup F(0; y) > sup F(?). 
5) 


yoo v 


Il suffit donc de prouver l’inégalité inverse. Nous pouvons supposer sans perte de 
généralité que ap(F) = 0. Posons 


F(0) = | out, )B(o) AF —v), 
de sorte que, par (10) et (14), ona 


sup |F(0; y) — Fy()| K 1/logy (vy > 00). 
0 


Comme I’application v +> o(u;,,)B(v) est continue sur T, F,)(#) est, pour chaque y, 
une fonction continue de v. Elle atteint donc son maximum en un point ¥,. Quitte a 
extraire une sous-suite, nous pouvons supposer que 


lim sup sup F(v; y) = lim, Fy(d,) 
yoo 7 


yoo 
et que vy tend vers une limite J) € T. Quitte 4 changer F(v) en F(0 — Vo), nous 
pouvons encore effectuer Il’hypothése que % = 0. 
Soit 6¢ := F(0+) — F(0—) le saut de F en 0. Posons 
H(d) := 14+ [0] — 5120) 


de sorte que F := F — pH est continue sur Z. Pour 8 € T, nous avons 


F,(9) = [ out, )B(v) dF(9 — v) + bolus, )B(O). (15) 
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Comme oe(u*,)B(v) tend simplement vers B(v) sur T avec une convergence 


uniforme sur tout compact de T ne contenant pas 0, et comme F est continue en 0, 
ona 


lim / o(u;, ,)B(v) dF (0, —v)= [eo dF(—v). 
y> 00 JT , T 
Tl suit 
F,(0,) < / B(v) dF(—v) + 4|8-| + 0(1). 
T 
Or, en faisant tendre y vers l’infini dans (15), nous obtenons successivement 
F(0+) = i B(v) dF(—v)F45p,  max{F(0+), F(0-)} = / B(v) dF(—v) +4 |6rl. 
T T 


On en déduit que 


Fy (0) < max{F(0+), F(O—)} + o(1) < sup F(#) + o(). 
5) 


Cela achéve la preuve de (7). 
Lorsque F' est continue, et puisque o(v) = 1 pour 0 < v < 1, la relation (5) 

implique 

o+1/y 


“dF \() + O(—), 


FO: - FO f on 


ot la constante implicite ne dépend que de F. Comme F est continue, et donc 
uniformément continue en vertu de la compacité du tore, il en va de méme de sa 
variation. La derniére intégrale tend donc vers 0 uniformément en ?. 


3 Preuve du Théoréme 1.2 
Sous l’hypothése que F est a variation bornée sur T et satisfait une condition de 
Lipschitz uniforme 
F(0 +h) — F() « |hl®, (16) 


ot. a > 0 et la constante implicite est indépendante de 3 € T, un théoréme de 
Zygmund ([9], vol. I, th. VI.3.6) implique que la série de Fourier de F, i.e. 


F(0) = Do en(F) (nd), (17) 


neZ 


avec la notation familiére e(f) := e?”" (t € R), est absolument convergente. 
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En fait, la démonstration de Zygmund fournit, sous la seule hypothése que F est 
a variation bornée, la majoration 


Y len) <5 VVeor(1/2"*) (ve NY), (18) 


21 <|n| <2” 


ou Vp désigne la variation totale de F sur T et h > w,(h) son module de continuité. 
Pour la commodité du lecteur, nous reproduisons ici la courte preuve de [9]. 


Soient v > 1, N := 2’t!. Pour tout 3 € T, nous avons 


Y |e(o+ x) -F(0 + )| < veor(—). 


1<k<N 


En intégrant sur T, nous obtenons 


Nn 1 
4n YS \en(F)|? si (2) <V, (<). 
2 lent )P sin? (=) < Vewr(— 


Nous en déduisons (18) en observant que, pour V/4 < |n| < N/2, nous avons 
J2/2 < | sin(n/N)| <1 


et en appliquant I’ inégalité de Cauchy—Schwarz. 
Sous la condition (16), nous déduisons immédiatement de (18) que 


1 
VlaPi«Krsz We. (19) 
y 
|n|>y 
Posons 
Fy(v?) = > cn(F) e(nt), 
In|<N (N>1,0€T, y= 2). 
Fy(O:y):= SY) en(F)e(nd) 
|n|<N 
P(\n|)<y 
Pour tous } € T, N = 1, nous avons, par (11), 
1 
Fy(8:y) —FvyO)=— DY en(F)e(nd) K 5 (20) 
|n|<N y 
P(\n\)>y 


ott la majoration découle de (19), la constante implicite étant donc indépendante 
de N. 
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Comme F € VB*(T), onac,(F) < 1/n pour tout n € Z*, donc Fy(v; y) tend 
simplement vers Fv; y) lorsque N tend vers l’infini. D’aprés le théoréme de Jordan, 
il en va de méme de la convergence de Fy(#) vers F(). Nous obtenons donc (8) 
en faisant tendre N vers l’infini dans (20). 
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On Conjectures of T. Ordowski and Z.W. Sun 
Concerning Primes and Quadratic Forms 


Christian Elsholtz and Glyn Harman 


On the occasion of Helmut Maier’s 60th birthday. With 
admiration for his beautiful results on the distribution of primes 


Abstract We discuss recent conjectures of T. Ordowski and Z.W. Sun on limits of 
certain coordinate-wise defined functions of primes in Q(./—1) and Q(./—3). Let 
p = 1 mod 4 be a prime and let p = a, + Cs be the unique representation with 
positive integers a, > b,. Then the following holds: 


k m/4 
lini ee mod 4 4p _ Jo cos* (x) dx 


Noe Do en eat moda Op is * sink (x) dx 


For k = 1 this proves, but for k = 2 this disproves the conjectures in question. 
We shall also generalise the result to cover all positive definite, primitive, binary 
quadratic forms. In addition we will discuss the case of indefinite forms and prove 
a result that covers many cases in this instance. 
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66 C. Elsholtz and G. Harman 
1 Primes and Quadratic Forms 


Tomasz Ordowski (see Sun [12], Sect. 6) conjectured: 


Conjecture 1.1. Let p = 1 mod 4 be a prime and let p = a) + by be the unique 
representation with positive integers a, > by. 


= a, 
(a) lim Dcheeil mod 4 “P = {+ 2 
N00 pen p=1 mod 4 bp 


2 
(b) linn och mod 4 ap = 9 
eG 
N-0o ae p=! mod 4 b; 2 


Z.W. Sun [12] stated a number of related conjectures for other quadratic forms, 
including the following: 


Conjecture 1.2 (Sun). Let p = 1 mod 3 be a prime and let p = x7 + xpyp + y, be 
the unique representation with positive integers x, > yp. 


by 


; N, p=1 mod 3 * 
(a) in ee Sa, 
N00 a mod 3 Yp 


(b) itn ae p=1 ile % = 52 


7 = . 
N-00 Dpenk p=! mod 1 Yp 9 


This conjecture can also be found in the comments on sequence A218585 in [10]. 
Sun further remarked that numerically 

3 4 
aval mod 3 Xp ee mod 3 Xp 


3 © 11.15 and lim 


~ 20.6. 
N08 en nal mod 1 yp 


lim 

N=. 29 Leen mod 1 

In this paper we will show that these limits exist and how one can evaluate 

them. Cases (a) of both conjectures turn out to be true, but cases (b) of both 

conjectures are wrong. It seems to us that the authors guessed the values based on 

some experimental data, but without theoretical justification. For the two quadratic 

forms above we will determine the nature and value of the sums for k-th moments. 
Throughout the text k denotes a positive integer. 


Theorem 1.1. Let p = 1 mod 4 be a prime and let p = a, + be be the unique 
representation with positive integers a, > by. Then 


k /4 
he: lim Deva mod 4 ap ie cos‘ (x) dx 
k= = 


ko pr/4_. ; 
N00 iD<N .p=1 mod 4 bi, i / sink (x) dx 


For k = 1 this value is indeed 1 + ./2. For k = 2 the value is at which is about 
4.50388. Here is a small table of exact and numerical values. 
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k | Exact value Approx. numerical value 
1 j1+/2 2.41421 
2 

ge 4.50388 
nx—2 

3 | 3(6+ 4/2) 7.61204 
3x +8 

4 12.2298 
3a —8 

5 | (43/2)/(64 — 43/2) = 43.(43 + 32/2) 19.0701 
15m + 44 
— 29.1700 

©. || teem uD 

T | (177/2)/(256 — 1774/2) = $25,177 + 1284/2) | 44.0371 
Qin + 64 
cuneate, 8612 
21n — 64 al 


Remark 1.1. The method of proof can be easily adapted to prove a slightly more 
general result: Let 0 < C, < C> be nonnegative constants. Let p = 1 mod 4 bea 
prime and let p = a, + by, be a representation with positive integers Cj ay < by < 
C2 dp. Then (counting representations with multiplicity if C) < 1 < Cy) 


k arctan C2 k 
a sepa mod 4 4p _ diane, Cos (x) dx 
LS 
p<N,p=1 mod 4 b 


I(Cy, G) := Ni 
N29 Pe Dp a sin’ (x) dx 


The special case C; = 0, C) = 1 immediately gives Theorem 1.1. 


Remark 1.2. It is possible to investigate the number theoretic properties of the 
integers in the related sequences. For example, the coefficients of 2 in the 
expressions 


240 k=2 


m/4 8+32 k=4 
i cos*(x)dx = 
0 =6 


mts k= 


etc. lead to the sequence 1,3, 10,35, 126,... which is well studied, with a lot of 
further comments and connections stated in the Online encyclopedia of integer 
sequences, sequence, A001700 [10]. 

On the other hand, for k = 2£ we find that J, = (Aga + Be)/(Aew — Be) where 
Ag is the sequence 


1,3, 15,21, 315, 3465, 45045, 15015, 765765, 14549535... 


which is, at the time of writing, not in the OEIS. However, the sequence is very 
closely related to sequence A025547, least common multiple of {1,3,5,...,2n — 
1} : 1,3, 15, 105, 315, 3465, 45045, 45045, 765765, 14549535, 14549535, .... The 
relation to this sequence is quite natural by the recursive nature of the integrals. 
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(Expanding the fractions would enlarge 21 to 105 and 15015 to 45045.) We do not 
follow any of these paths further. 


We now study two properties of the values of ik: 


ak 
Corollary 1.1. Let Ie = limy-soo Z2S%2=!™4% then Ip € Q(/2), with 


= bk? 
D=N p=1 mod 4 “p 
irrational, when k is odd, and Ik € Q(x), with I, transcendental, when k > 2 is 
even. 


The following result states the asymptotic growth of I;: 


Theorem 1.2. As k tends to infinity, we have the following estimate: 


1 
k\2 
cs (=) 2k? () 
2 
We now come to Conjecture 1.2. A consequence of our main theorem below 


(Theorem 1.4) is the following which we state as a result in its own right. 


Theorem 1.3. Let p = 1 mod 3 be a prime and let p = x +XpYp +y, be the unique 
representation with positive integers X) > yp. Then 


Dopevipel mod 3 x _ U(k) 


J, = lim = ; 
N90 Dopehoel mod 3 yk V(k) 
where 
1/6 
U(k) = i (V3 cosx — sinx)* dx 
0 
and 


1/6 
V(k) = 2* / sink x dx. 
0 


In particular this gives 


a 

1 1+73 2.73205 

2 zl 5.78012 
In — 3/3 

3 | £67 + 454/3) 11.1494 
2(./3 —2 

4 2(V3 — 2) 20.5927 
1/3 — 4 

5 | (1837 + 1113/3) | 37.1698 
9/3 — 10 

6 | 2¥3 = 10R 66.2204 
18/3 — 10x 


Again, part (a) of the conjecture is correct, while (b) is not. 
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Theorems 1.1 and 1.3 above, and many others, are consequences of the following 
general result. Before stating this we need to make some comments on the 
uniqueness of representations. In the examples above the representations are unique, 
but it is easy to find forms which give two representations. For Theorem 1.3 if 
we had considered instead Q(x, y) = x* — xy + y” so that O(x,y) = O(x,x — y) 
and thus 0 < y < x is equivalent to 0 < x—y < x. We thus obtain two 
representations for every representable prime. Similarly, if instead of x7 + y* we 
considered (x — y)? + y? = x? — 2xy + 2y’. We note that x* — 3xy + 3y” gives four 
different representations with 0 < y < x of primes p = 1 mod 6. The worst case 
forms are ax” + bxy + cy” with 4ac — b* = 3 and b large and negative. For these we 
have 5 or 6 representations. For example, with Q(x, y) = x? — 7xy + 13y’ we have 


109 = Q(8, 5) = Q(16, 7) = Q(27, 5) = Q(33,7) = O(41, 12) = Q(43, 12) 
(6 representations), 
103 = Q(17, 2) = Q(25, 9) = Q(35, 11) = O(38, 9) = Q(42, 11) 


(5 representations). 


We therefore need to state our general theorem carefully to take this into account. 
The reason for the numbers of solutions occurring will become apparent in the proof 
of the theorem. 


Theorem 1.4. Let Q(x, y) = ax* + bxy + cy’ be a positive definite, primitive (i.e. 
gcd(a, b,c) = 1), binary quadratic form with integer coefficients. Write D = 4ac — 
b?, so D > Oas the form is positive definite. Let 8 = /D, and 


B= in ifa+2b=0, 
arctan(5/(b + 2a)) otherwise, 


with arctanx € [0,2]. Then, for primes p represented by Q(x,y) withO < y < x 
we let Xp, yp denote a solution to 0 < yp < Xp, O(Xp. yp) = p. For all other primes 
we write Xp = Yp = 0. When there is more than one pair Xp, yp) we assume all pairs 
are counted in the expressions that follow. We define Xn, yn similarly for any positive 
integer n. We then have 
R, = lim Rwy tim Sennen a) 
N=>.00 DpeN yp N00 en yh T(k) 


where 


B 
sa) = | (8 cos @ — bsin 8)‘ dé , 
0 
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and 


B 
T(k) = (2a) / sin’ 6 dé . 


We give one further corollary to illustrate the general result. 


Corollary 1.2. Let n € Z,n > 2. Then, in the notation of Theorem 1.4 with 
Q(x, y) = x? + ny’, we have 


Ri =1l+vn+1, 


ie n(./n + (1 +n) arctan ./n) 
> /a + (1 + n)jarctan Jn” 


_— B+ 2n)2 + 3n+ 20+ n)?/?) 
7 3+ 4n , 


R3 


This follows by taking a = 1,b = 0,c = nin Theorem 1.4 which gives 6 = 
2./n, B = arctan(./n). It can also be observed that if n + 1 is a square, then some 
of these limits are indeed rational numbers. 

Z.W. Sun also made a conjecture about the form u, + 3upUp + ie for prime 
p = +1 mod5: 


Conjecture 1.3 (Sun). Let p = +1 mod 5 be a prime and let p = uy, + 3UpUp + uv, 
be the unique representation with positive integers u, > Up. 


=+1 mo Up 
eet d 5 °F = if deal 5. 


p<N, p=+1 mod 5 Up 


lim 
Noo 


He also remarked that it seems that 


ee 


D<N, p=+1 mod 5 UZ 


uw 
: t+ | d5 
lim mee Pw 8.185. 
Noo 


While Theorem 1.4 is only valid for positive definite forms we remark that a 
formal application of the integrals, with a = 1,b = 3,D = V—5 leads to S(1) = 
—3 + /5 and T(1) = 2—V5 which predicts indeed 


> <N, p=+1 mod 5 Up 
lim P="? =14 V5. 
N00 ene mod 5 Up 


Moreover, for k = 2 we find S(2) = —i(/5 —2 artanh(z)) and T(2) = —£(/5— 
4 artanh( A)» which predicts 
2/5 — 4artanh(—L) 
lim us 


N00 ees mod 5 ve J5 —4 artanh(~z) 


2 
=+1 mod 5 U 
pay pk dsp ~ 8.18483. 
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Now there are several potential difficulties when looking at the problem for 
indefinite forms. For example, in the most general case there is not the control of 
variable size in terms of prime size that we have in the positive definite case. At first 
sight there could be issues working in fields with infinitely many units, in particular 
how we count the number of representations. Moreover, the results we need from 
Coleman’s work [1, 2] are not explicitly stated for indefinite forms, although careful 
study shows that he does indeed prove the theorem we require. We thus are able to 
prove the following result which covers the above conjecture and many other cases. 


Theorem 1.5. Let Q(x,y) = ax* + bxy + cy be an indefinite, primitive, binary 
quadratic form with integer coefficients. Write D = b* — 4ac, which we assume 
is not a perfect square. Let 8 = \/D and we assume that 0 < 6/(b + 2a) < 1, 
for if this condition fails there will be infinitely many representations of a prime by 
Q(x, y) withO < y < x. Put k = artanh(6/(b + 2a)) (so this is well defined by the 
previous condition). Then, for primes p represented by Q(x, y) with0O < y < x we 
define Xp, Vp by 0 < Yp < Xp, O(Xp, Yp) = p. When there is more than one pair Xp, Yp 
we assume all pairs are counted in the expressions that follow. For all other primes 
we write Xp = Yp = 0. We define Xn, y, similarly for any positive integer n. We then 
have 


k 
R, = lim Lrsw%p = lim Yin<w%n = U(k) 
N00 Dnew Yk N00 Drew yk — Vk) 


where 
U(k) = i : (5 cosh @ — bsinh 6)‘ dé , 
and 
V(k) = (2a)* i : sinh‘ 6 d6. 


Remark 1.3. We will indicate in the proof how one gets multiple representations of 
primes and why representations are unique for some forms like ur, + 3upUp + es 


We give a corollary working out these integrals for a particular family: 
Corollary 1.3. Let n € Z,b > 3. Then, in the notation of Theorem 1.5 with 
O(x, y) = x? + bxy + y’, we have 


Ri =14+Vb+2. 


pe, - O- DV 4 ~ 4artanh YO= D/O +2) 
o VPA Atte (SDE 


12 C. Elsholtz and G. Harman 


2 Proofs: The Simplest Case 


We begin by quoting a simple consequence of Coleman’s work for Gaussian primes 
which we will use to prove Theorem 1.1 directly and which motivated the whole 
investigation. The reader will see that the proof of Theorem 1.4 is a straightforward 
generalisation of this, though the necessary terminology may at first make it look 
more complicated. 


Lemma 2.1 (Coleman, Theorem 2.1 of [1]). Let 0 < g, < @ < 2m, and0<y< 
x. We can define S = S(x,y, 01, 92) = {z € Zi]: x-y < [z|? <x,91 < GY < Po}, 


where p = arg(z/|z|)*. Let P = P(x, y, 1, @2) = {p € S: |p|* = p, prime}. 
Let ¢ > 0 be given. We have the asymptotic result, 


- 1 
p= B= (14 9(__)), 
2 logx log x 


PEP(x,y.91,92) 
for ~2—-~1 > x 0/24tE y> yl9/24+e > Xe. 
Let us first outline the idea: for asymptotically evaluating ae ee a it 


suffices to dissect the sector with radius N from 0 < g < 2/4 into polar boxes. 
Coleman’s result says that one can dissect this into fine (but not too fine) boxes, so 
that the number of primes, corresponding to p = a? + b’, is asymptotically the right 
number, with some error of smaller order. We can therefore replace summation by 
integration with negligible error. The same is true for the sum in the denominator. 
Hence, we get: 


k m/4 
lim Lies mod 4 a) _ f cos* (x) dx 


k 4. . 
N=7.00 DuneN pul mod 4 by i sin‘ (x) dx 


The reader should note that this is exactly the same relation we would get if p ran 
over all numbers representable as the sum of two squares. 


Proof (Proof of Theorem 1.1). We now describe this in more detail. We write T = 
2[log, N],v = JN. Let us first concentrate on Gaussian primes with modulus in 
the interval (5, v], we later sum up over intervals of type (5;, sr]. For 1 < s < 
3T/4,1 <t < T/2 we then define polar boxes 


By, = {a= re’? : l-—s/T< (r/v)? <1-—(s—1)/T,(t—1)/T < 29/m < t/T} 


t—1 t 
= sa=re?: /NVJ/1—s/T <r<JVNV1-(s—1)/T, me deyat : 
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We note that a polar box {(r,0) : Rj < r < Rj,’ < 6 < 6 + $} has area 
5(Ri _ R3)¢. It follows (something we will need later when we convert sums to 
integrals) that the box B,, has area Nx/(4T7). Write n = (log N)~!. Now logM = 
logN + O(1) for N/4 < M < N. Hence, by Lemma 2.1 (note though that the 
corresponding polar box in that lemma has an angle four times that of B,; ,) for each 
pair s,t we have 


N 
>, I= ican OM): 


PEBs 1 


In each polar box cosy = cos(t2/2T) + O(n) and similarly for sing. Also 
r=v(1—- s/T)? + O(vn). Hence 


a 6% = dG 


N <p<Np=1 mod 4 St peBs; 


II 


(1 + O(n)) Deen y — s/T)? cos(tx/2T) + O(n»). 


nan 


oe 


“oy +0) >> / : (rcosx + O(vn))*r dr dx 


Sit 
reX EBs + 


4 v m/4 
—(n+ oor) | he ar | cos* x dx. 
1s v/2 0 


We remark that it is easy to check the case k = 0 of the above which must give 
the number of primes = 1 mod 4 between N/4 and N. This is sNn(1 + O(n)) and 
equals the final line of the above display by an elementary calculation. 

Similarly, 


4 v m/4 
se by = —(n+ our) | yet ar f sink x dx. 
N a v/2 0 
|Z <PSN.p=1 mod 4 


Adding up over the intervals (+ 
that 


ie = s4r], and cancelling common factors one finds 


k m/4 
fai peioel mod 4 4p __ i cos* (x) dx 


k m/4_. : 
Nene D<N,p=1 mod 4 b, is / sin‘ (x) dx 


Proof (Proof of Corollary 1.1). We now evaluate these integrals, for small expo- 
nents, and determine the arithmetic nature of the values. It is well known that 
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n/4 sin”! xcosx i n—1 2/4 
/ sin"(x) dx = — / sin"? x dx 
0 n 0 n 0 
1 —]1 /4 
= Z sin” * x dx 
n2n/2 n Jo 
ait cos’! x sin x 7 n—1 7/4 
/ cos" (x) dx = ——__— / cos”? x dx 
0 n 0 n 0 
1 KoA ft 
= at oa i cos”? x dx 
n n 0 


We note for the initial values n = 0 and n = 1 that 


m/4 
i sin? (x) dx = 
0 4 
m/4 1 
/ sin! (x) dx = 1- —=, 
0 J2 
1/4 
/ cos?(x) dx = i 
0 4 


m/4 1 
/ cos! (x) dx = —=. 
0 


From this it is easy to calculate for small k the limit by hand. For example, let k = 5. 


ie in? (x) d : 2a : ) lig 5/2) 

Sin’ (x) ax = — _ = —_ e 

5 3.23/2 * 3 ga Ae 

re in’ (x) dx = oe 13 5/2) = : (64 — 43/2) 
ne Map aa ee a 1D ~ 720 


i 30) d 121 5/2 
cos x= = . 
: ra 72 3. 1 


+ 


ie ei dies 1 4 5v2 = 43/2 
0 5.25/25 5 12 120 ° 
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Hence 


k= tin yee mod 4 a, -_ Hi = 43/2 
= = = . 
N00 Yo nen pat mod 42, Ag (64-432) 64 — 43./2 


It is clear from the recursion formulae above, that for odd k, i * cost xdx = 
ri V2, and fr sin’ xdx = ro + 732, where r1,1,173 € Q\{0}. It follows that 
Ik € Q(V2) \ Q. Similarly, as remarked above, when k = 2£ we find that i, = 
(Aga + Be)/(Acx — Br) with integers Az, Be > 0, so the value of /(k) is a rational 
expression of zr, where the z can never cancel, hence /(k) is transcendental, when 
k > 2 is even. 


Proof (Proof of Theorem 1.2). We note that 


m/4 m/2 
/ cos"(x) dx = / cos” (x) dx + O (2-7) : 
0 0 
For n even, say n = 2m, we have by the recursion formula 


[- "a) d _ % (2m—1)(Qm-3)...3 
pe = =), 


_ «a (2m)! 
~ 2 (2m!) 
x (2m/e)?"/4am 


~ "2 22m (m/e)?"270m 


1 
(53) 
Qn} * 
In the above we have used Stirling’s formula to obtain asymptotic formulae for m! 
and (2m)!. Similarly, for odd n we have 


wel m(1+n) 2 TU 5 
"(x) dx ~ ~ . 
i cos” (x) dx ( re ) (=) 


On the other hand, we note that 


r 


sin(/4 — x) = 7 (1—x+ O(@°)). 


Hence the natural logarithm of the function e2*/?(sin(a/4 — x) is 


K log2 + kx + klog(sin(sr/4 — x)) = O(kx). 
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From this we easily obtain 
(sin(7/4 — x) = 2-*/?e* (1+ O(ke)). 
Hence, writing A = 2/4 —k-?/9, w = k!/3, we have 


/4 Xr m/4 
i sink (x) dx = / sin‘ (x) dx + i sin’ (x) dx 
0 0 A 


—k/2 


= 0(2-*/e7H) + : (1+ O0(u')). 


Combining the above results gives (1) as desired. 


3 Proofs: The General Positive Definite Case 


A number of authors, including Landau, Hecke [3, 4], Rademacher [11], Kubilius 
[8, 9], Kalnin’ [6] and Coleman [1, 2], established results with regard to equidis- 
tribution of prime ideals in certain regions. We shall here need to work with the 
distribution of prime ideals and the 1-1 correspondence that exists between these 
ideals and the representation of their norms by positive definite quadratic forms. A 
further complication arises that the discriminant of the quadratic form may not be a 
fundamental discriminant (those given in the next paragraph) and this forces us to 
use a more general result. 

We must now define the notation needed to state Coleman’s theorem in its full 
generality and we will quote this more or less verbatim from [1]. Let Q(vA ) be the 
imaginary quadratic number field with discriminant A or 4A depending on whether 
or not A = | mod 4, respectively, and A is a negative square-free integer. We use 
Gothic letters a, f to denote ideals and p will represent a prime ideal. We write N(a) 
for the norm of a. Given a non-zero ideal f, let g = g(f) be the number of units € such 
that ¢ = 1 mod f. We write K for an ideal class mod f and (€) for the principal ideal 
generated by an algebraic integer €. For each such class we assume there has been 
chosen and fixed an ideal ag € K~'. Then given a € K we can define 4 € ag by 
(a) = ado and &, = | mod f. This algebraic integer is unique up to multiplication 
by the units € = 1 mod f. We write 


— (i) : 


By the definition of g, arg(A(&,)) is unique mod 277. 


Lemma 3.2 (Coleman, Theorem 2.1 of [1]). Given 0 < g, < @ <27,0<y<x 
and an ideal class K mod f. We define S = S(x,y, 91,92,K) = {ae K:x-y< 
N(a) < x, g1 < arg(A(Ea)) < G2}. Let P = P(x, y,91,¢2,K) = {tp € S: N(p) = 
Pp, prime}. 
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Let ¢ > 0 be given. We have the asymptotic result, 


_ (G2 - gidy 1 
= opie (tt liege)» 


for =o > at, pS x > Here h(f) is the order of the abelian 
group of ideal classes mod f. 


peP(x,y,~1,¢2,K) 


Proof (Proof of Theorem 1.4). The proof that 


lim Ye — S(k) 
N->00 ) nn yk T(k) ' 


follows by applying a simple change of variable to map the region 
O<y<x, OXy)<N 
onto the sector 
O<r<N?, 0<6<f£. 
This change of variable will underlie the proof of the more difficult case for primes 


lim 2sn*” _ SCH) 
wee yey ys TR)’ 
D<N “p 


so we shall concentrate on establishing this. 

Let A = —D, and we suppose initially that this is a fundamental discriminant 
with A < —4. Hence there are just the two units —1,1 in Q(V/A). We follow 
Coleman’s argument based on [7] to obtain the 1-1 correspondence between prime 
ideals and points at which Q(m, n) is prime. In Lemma 2.1 we take 


ay = [a, $(b — i8)] . 
The 1-1 correspondence is then 


N(p) = p = Q(m,n) 


b spent 
p — ma n 2 . 


We must remark at this stage that this correspondence is 1-1 between prime ideals 
and m,n and not necessarily between p and m,n. This is why we may get two 


with 
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representations if two distinct prime ideals with norm p fall within the sector we 
are about to describe (which can be larger than the first quadrant if b < —2a). 
We note that |&,|? = ap. So, if we write 


@ = arg). r= VP. 


we have 
— (5cos $ + bsing) 2a—— sing 
m = —— (6cos sing), n= —2a—— sing. 
fad fads 
Now writing 6 = —¢ the condition 0 < n < m translates to 0 < 6 < B with 
”_ (5cos6 — bsin 6) 2a—— sin 0 
m = —— (bcos @ — bsin6), n = 2a—— sin. 
Jad Jad 


We divide the region0 <r <N 2 ,0 < @ < 8 into polar boxes as in the proof of 
Theorem 1.1 and apply Lemma 3.2 to the corresponding regions. As in the proof of 
Theorem 1.1 we can convert the sums to integrals with smaller order errors. When 
we divide one sum by the other the integrals over r cancel, as do various constants 
and the 1/ log N factor, leaving just S(k)/T(k) as claimed. 

Now if A = —3 or —4 we have 6 or 4 units, respectively. We recall that &, is only 
unique up to multiplication by units. In the above argument we can have up to two 
distinct prime ideals and &, could be multiplied by 3 or 2 units leading to different 
values m,n and still remain within the sector under consideration. This leads to the 
multiple representations. Of course, for Theorem 1.1 « = 2/4 constrains & to one 
value, and similarly for Theorem 1.3 k = 7/6, leading to one value. For the form 
x? — Txy + 13y? we have A = —3, « = arctan(—/3/5) = 2.808.... We note 
that 52/6 < « < a and so there will be either 5 or 6 representations depending on 
whether or not all 6 possible values for & lie in the sector. 

Now suppose that D = —f?A with A a fundamental discriminant and continue 
working in Q(./A). If we repeat the above argument, then we would require 


& =mat+n (5*) ; 


We therefore need to restrict ourselves to counting those prime ideals which lead to 
such &,. This corresponds to restricting the prime ideals to a union of ideal classes 
mod f fora suitable f. We give the simplest case by way of illustration. Let Q(x, y) = 
x* + 4y, so D = 16 = f?(—A) with f = 2. Let f = (2). Then there are just two 
ideal classes coprime to f: {(u + vi) : v even, u odd}, {(u + vi) : u even, v odd}. 
Counting only prime ideals in the class with v even will then give p = x* + 4y” as 
required. 
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4 The Indefinite Case 


We first consider how to describe the geometry in the real case. The natural 
embedding from Q(8) into R? is a + bS —> (a, bd). We write ¢’ for the algebraic 
conjugate of € € Q(é6). We shall discover that the polar boxes of the imaginary 
case give way to hyperbolic boxes in this new situation. We then note that there is 
an analogous correspondence between prime ideals and points at which Q(m, n) is 
prime. However we now need to be much more careful about the number of points 
(m,n) corresponding to each prime ideal. We need only deal with the fundamental 
discriminant case as the adaptation to the general case follows as previously. Now 


we write 
£ 4p b—5 
= ma+n{— 
i 2 
2 2 


=u+vd whereu*—Dv? =ra 


Jar(cosh¢@ + sinh@) for a uniquely defined value ¢. 


II 


Note that this gives a 1-1 correspondence between & and (m,n). Following the 
argument of the previous case, if we write @ = —@ then 0 < m < n becomes 


m= Ts (5 cosh 6 — bsinh@), n= 2a sinh 


with 0 < 6 < x. We thus have a hyperbolic box 
{(rcosh 6, rsinh@):0<r< N2,0 <0 <k}, 


which we dissect into small hyperbolic boxes in a corresponding manner to our 
earlier discussion. In particular, where &, occurs we define t(&,) to be the unique 
value ¢ such that 


E, = J/N(a)(cosht + sinht) = /N(a)e’. 


Now &, is only unique up to multiplication by units. Let €9 be the fundamental unit 
of Q(6). Multiplying &, by an even power of €9 gives another candidate for &, but 
shifts @ by 2 log €9. Hence the maximum number of additional representations of p 
from this ideal will be strictly less than 


K 


: 2 
2 log € 2) 


However, there is the ideal containing the algebraic conjugate of &, that is ae to 
consider. Possibly multiplying Es by an even power of € 9 will give a number in 
the required range. So, writing t = t(&»), we would need 0 < —t < x and 0 < 
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t+ 2hlogeo < xk. In the case of x* + 3xy + y” the value of (2) is exactly 1/2 
and so the representation is unique. On the other hand, for x” + xy — y* the value 
is 1. There can therefore be no more representations from the original ideal, but 
if 0 < —t < « then we obtain 0 < t+ 2logeéy < « giving exactly one other 
representation. Of course, it is easy to see there will be two representations for this 
form as the transformation (x, y) —> (x, x — y) leaves the expression unchanged. If 
0 < 6/(2a + b) < 1 fails, then the only restriction on @ is 6 > 0 and so we obtain 
infinitely many solutions by multiplying ) by any positive even power of €o. That 
is why we cannot prove a result of the required form in this case. 

We must now show that Coleman’s work in [2] supplies us with the required 
formula for prime ideals in hyperbolic boxes as above. There Theorem 1.2 is his 
main theorem. We remark that Hensley [5, §5] also gives an explicit account for 
how to count prime ideals in hyperbolic boxes in the case of real quadratic fields. 
We must first describe Hecke characters in Q(6), and we quote Hecke’s original 
definition almost verbatim from [4]. Given an integer 9 of Q(6) we define 


This is clearly a multiplicative function of @ which takes the value | at all units 
(as they are powers of €9, and dividing a unit by its conjugate gives an even power 
of €o). The Hecke characters are then all integer powers of A(-). If N(@) = r, and 
o = r(cosht + sinh f), then 


in Q 


fou 


A(Q) = exp ( log 


log €y 


o’ = r(cosht— sinh?), 


We can therefore use the Hecke characters to pick out the conditiona <t<o+T. 
Since we can investigate the size of the norm using N(@)*(s € C) just as in the 
imaginary case, we obtain the following result from [2, Theorem 2]. We continue to 
use the terminology stated before Lemma 3.2. 


Lemma 4.3. Let ¢ > 0 be given. Given0 < g < gtt<1<xx lt? <1 <1 
and an ideal class K mod f. We define S = S(x,@o,1,K) = {a € K: x(1—T) < 
N(a) S x,Yo S t&a) S Yo + tH. Let P = P(x, Go,t,K) = tp € S: N(p) = 


p, prime}. We have the asymptotic result, 
ys i= xt? (: e o( 1 )) 
pe€P(x,go,t,K) h(f) log x logx 


Here h(f) is the order of the abelian group of ideal classes mod f. 


This supplies us with precisely the formula we need for counting prime ideals in a 
hyperbolic box and the proof can then be easily completed. 
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Large Gaps Between Consecutive Prime 
Numbers Containing Perfect Powers 


Kevin Ford, D.R. Heath-Brown, and Sergei Konyagin 


For Helmut Maier in celebration of his 60th birthday 


Abstract For any positive integer k, we show that infinitely often, perfect kth 
powers appear inside very long gaps between consecutive prime numbers, that is, 
gaps of size 


log p log, p logy p 
(log; p)? 


where p is the smaller of the two primes. 


1 Introduction 


In 1938, Rankin [11] proved that the maximal gap, G(x), between primes < x, 
satisfies! 


log x log, x log, x 


GQ) > (+ (WF 
3 


(1) 


‘As usual in the subject, log, x = log log x, log, x = log log log x, and so on. 
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with c = i. The following six decades witnessed several improvements of the 


constant c; we highlight out only a few of these. First, Rankin’s own improvement 
[12] c = e” in 1963 represented the limit of what could be achieved by inserting 
into Rankin’s original 1938 argument best possible bounds on counts of “smooth” 
numbers. This record stood for a long time until Maier and Pomerance [8] 
introduced new ideas to improve the constant to c = 1.31256e” in 1989; these 
were refined by Pintz [10], who obtained c = 2e” in 1997. Very recently, the first 
and third authors together with Green and Tao [2] have shown that c can be taken 
arbitrarily large. Independently, this was also proven by Maynard [9]. 

Rankin’s lower bound (1) is probably very far from the truth. Based on a 
probabilistic model of primes, Cramér [1] conjectured that 


; G(X) 
lim sup —— = 
X>00 log’ X 


’ 


and Granville [3], using a refinement of Cramér’s model, has conjectured that the 
lim sup above is in fact at least 2e~” = 1.1229.... Cramér’s model also predicts 
that the normalized prime gaps srg should have exponential distribution, that is, 
Pnti — Pn = Clog p, for about e~°x(X) primes < X. 

Our aim in this paper is to study whether or not long prime gaps, say of the size 
of the right-hand side of the inequality in (1), occur when we impose that an integer 
of a specified type lies inside the interval. To be precise, we say that a number m is 
“prime avoiding with constant c” if m + u is composite for all integers u satisfying 


log mlog, mlog,m 
(log; m)* 


|u| < 


Here we will be concerned with prime avoiding perfect powers. 


Theorem 1.1. For any positive integer k, there are a constant c = c(k) > 0 and 
infinitely many perfect kth powers which are prime-avoiding with constant c. 


It seems possible that the methods of Green, Ford, Konyagin and Tao, or of 
Maynard, might be adapted to handle slightly longer intervals containing a kth 
power but no primes. However we leave this possibility aside for the time being. 


2 Sieve Estimates 


Throughout, constants implied by the Landau O-symbol and Vinogradov <-symbol 
are absolute unless otherwise indicated, e.g. by a subscript such as <,,. The symbols 
p and q will always denote prime numbers. Denote by Pt () the largest prime factor 
of a positive integer n, and by P” (n) the smallest prime factor of n. 

We need several standard lemmas from sieve theory, the distribution of “smooth” 
numbers, and the distribution of primes in arithmetic progressions. 
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Lemma 2.1. For large x and z < x1°8*/(0l82) | we have 
g' 


l{n<x:PT(n)<3l<« 


x 
log? x 
Proof. This follows from standard counts of smooth numbers. Lemma | of Rankin 
[11] also suffices. oO 
Lemma 2.2. Let & denote any set of primes and let a € {—1, 1}. Then 
x 1 
{ip <x:p#a (mod r) (Vre &)}| « — I] (1-2). 


log x fer Dp 


pw 


Proof. Standard sieve methods [4]. oO 


Finally, we require a bound of “large sieve” type for averages of quadratic 
character sums. 


Lemma 2.3. For any set P of primes in [2, x], and for any ¢ > 0, 


Proof. This follows immediately from Theorem 1 of [5]. oO 


3. kth Power Residues and Prime Ideals 


One of our principal tools is the following estimate for an average of counts of 
solutions of a certain kth power congruence. 


Lemma 3.1. Let k be a positive integer. For any non-zero integer u and any prime 
p write 


Qur(P) = Qu(p) = |{n (mod p) : n‘ + u = 0 (mod p)}!. 
Then for any fixed ¢ > 0 and x = 2 we have 

y logx 
TT (1-222) co it CE. 
x<p<y ‘a oBy 


The proof of this is based on the Prime Ideal Theorem, and we begin by giving a 
formal statement of an appropriate form of the latter. 
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Lemma 3.2. There is an effectively computable absolute constant c > 0 with the 
following property. Let K be an algebraic number field of degree nx, and write dx 
for the absolute value of the discriminant of K. Let Bo be the largest simple real zero 
of €x(s) in the interval [5. 1] if any such exists. Then 


|zx (x) — Li(x)| < Live®) + e7!x exp{—cng log!/? x} 


for x > exp{10ng log? dx}, where as usual, 1x (x) denotes the number of prime 
ideals of K with norm at most x, and Li(x) = i dt/ log t. We omit the first summand 
on the right-hand side if By does not exist. 


This follows from Theorem 1.3 of Lagarias and Odlyzko [7], on choosing L = K 
in their notation. The reader should note that the counting function mc¢(x, L/K) of 
[7] excludes ramified primes, but the number of these is O(nx log dx), which is 
majorized by x exp{—cng/” log!/? x}. 

In order to handle the term involving the possible simple real zero Bo we use the 
following result of Heilbronn [6, Theorem 1]. 


Lemma 3.3. A simple real zero of €x(s) must be a zero of ¢;(s) for some quadratic 
subfield k of K. 


It follows that Bo is a zero for some quadratic Dirichlet L-function L(s, y), with 
a character y of conductor dividing dx. Thus Siegel’s Theorem shows that 1 — Bo > 
c(e)d,*, for any fixed e > 0, with an ineffective constant c(é) > 0. We then deduce 
that 


Li(xP) « xP < xexp{—c(e)dy* logx} < xexp{—c(e)ng log!/? x} 
if nxlogx = de. We therefore obtain the following version of the Prime Ideal 
Theorem. 


Lemma 3.4. For any n > 0 there is an ineffective constant C(n) > 0 with the 
following property. Let K be an algebraic number field of degree nx, and write dx 
for the absolute value of the discriminant of K. Then 


K(x) = Li(x) + O(xexp{—C(n)ng"? log!/? x}) 


x > exp {max (10nx log? dx, ng'dx)} ; 


Proof (Proof of Lemma 3.1). Since it may happen that —u is a perfect power we 
begin by taking a to be the largest divisor of k for which —u is a perfect ath power. 
Then if —u = v“ and k = ab we see firstly that the polynomial X? — v is irreducible 
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over the rationals, and secondly that n? = v (mod p) implies n* + u = 0 (mod p), 
whence 


Ouk(P) Zz O-vp(p). (2) 


We will apply Lemma 3.4 to the field K = Q(6), where 6 is a root of X” — v. Thus 
K has degree b < k. Moreover its discriminant will be a divisor of 


D := Disc(1, 0, 07,...,02-') = (-1)? 1b’? 
We now set 
xo = C(k,n) exp(|D|"). 


If we choose the constant C(k, 7) sufficiently large, then whenever x > xo we will 
have 


xexp{—C(n)b!/? log!/? x} < xlog-? x 
and 
x > exp {max (10b(log |D|)”, b”"|D\")} ; 
It therefore follows from Lemma 3.4 that 
K(x) = Lix) + Oxy (x log? x) 


for x > xo. 
We now write vx(p) for the number of first degree prime ideals of K lying above 
p. Then 


x(x) =D ve(p) + On D> 1). 


pw pe <x, e=2 


Moreover, by Dedekind’s Theorem we will have 9_,,(p) = vx(p) whenever p + D. 
In the remaining case in which p | D we have 0_,4(p) < b < k and vg(p) < b <k. 
It therefore follows that 


x(x) = Y > O-v,o(p) + Ox(x'/”) + Ox(log |D)), 


pw 


so that 


Y> e-v.o(p) = Li(x) + Ox n(x log x) 


pw 


when x => xo. 
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We now observe that 


I] (1- 22") <exp -») Qu(P) 


x<p<y x<p<y ¥ 


< exp ¢— a O-v,p(P) 
Pp 


x<p<y 


by (2). Assuming that y > x) we may then use summation by parts to calculate that 


y Q-volP) y Q-vb(P) 
zoey «COP ‘iemensy. © 
= log log y — log log (max(x, xo)) + Og,)(1) 
= log log y — log log x — log log xp + Oxy (1) 
= log log y — log log x — nlog |D| + Ox,() 


= log log y — log log x — nk log |u| + O;,(1). 


We therefore have 


m logx 
Il (1-2 2) Ken lull” 28 
Pp logy 


x<p<y 


when y = xo. Of course this estimate is trivial when y < xo since one then has 
logy Kk» |D|” “xy |u|”. The lemma then follows. Oo 


4 Main Argument 


Fix a positive integer k. Let x be a large number, sufficiently large depending on k, 
let c; and c be two positive constants depending on k to be chosen later, and put 
c2x log x log; x 


N= . z= x"! log; x/ logy = = 
Ip Ye (logs x)? 


pw 
In the rest of the paper we will prove the following lemma. 
Lemma 4.1. There is a number m < 2N such that m‘ + u is composite for |u| < y. 


Theorem 1.1 will follow upon observing that m’ < e+?) as x —> oo and 
consequently that 
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log(m*) log, (m*) log, (m*) 
(log, (m*))? 


We will select m by choosing residue classes for m modulo p for primes p < x. Let 


yk 


FP = {p:p < logxorz <p < x/4}, Pr = {p:logx <p < z}. 
We first choose 


m=0O(modp) (pe ), 


m=1(modp) (pe A). o 
Observe that p|(m* + u) if p|u for some p € Y;. Because y < (x/4) log x, any 
remaining value of u is thus either composed only of primes in “> (in particular, u 
is z-smooth), including |u| = 1, or |u| is a prime larger than x/4. For any u in the 
latter category such that p|(u+ 1) for some p € Ao, p|(m* +u). Let U denote the set 
of exceptional values of u, that is, the set of u € [—y, y] not divisible by any prime 
in Y, and such that if |u| is prime then p + (u+ 1) for all p € A). By Lemmas 2.1 
and 2.2, if c; is sufficiently small, then 


y y 1 y log, x Co x 
5 —— 1— < = . 
loge x logx 2) logxlogz cj; logx 


lJUl<« 
pEP2 


Choosing c2 appropriately, we can ensure that |U| < éx/logx, where 6 > 0 depends 
on k (6 will be chosen later). 

The remaining steps depend on whether k is odd or even. If k is odd, the 
construction is very easy. For each u € U, associate with u a different prime 
Pu © (x/4,x] such that (p, — 1,k) = 1 (e.g., one can take p, = 2 (mod k) if 
k = 3). Then every residue modulo p, is a kth power residue, and we take m in the 
residue class modulo p,, such that 


m* =—u (mod p,) (u€ U). (4) 


By the prime number theorem for arithmetic progressions, the number of available 
primes is at least 


x/(2$(k) log x) = |U| 


if 6 is small enough. With this construction, p,,|(m* + u) for every u € U. Therefore, 
m* + uw is divisible by a prime < x for every |u| < y. Furthermore, (3) and (4) 
together imply that m is defined modulo a number N’, where N’|N. Therefore, there 
is an admissible value of m satisfying N < m < 2N. The prime number theorem 
implies that NV = e*+°™, thus m* — y > x. Consequently, m* + u is composite for 
|u| <y. 
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Now suppose that k is even. There do not exist primes for which every residue 
modulo p is a kth power residue. However, we maximize the density of kth power 
residues by choosing primes p such that (p—1,k) = 2, e.g. taking p = 3 (mod 2k). 
For such primes p, every quadratic residue is a kth power residue. Let 


Ps = {x/4 <p <x/2: p = 3 (mod 2k)}. 


By the prime number theorem for arithmetic progressions, | 73| = x/(5¢(2k) log x). 
We aim to associate numbers u € U with distinct primes p, € “3 such that (=) = 


1. This ensures that the congruence m* + u = 0 (mod p,) has a solution. We, 
however, may not be able to find such p for every u € U, but can find appropriate 
primes for most u. Let 


yi = (mH) bx, 
= jueU: | — | = 1 forat most ie primes p € Pr . 
Pp x 


The numbers u € U\U' may be paired with different primes p, € #3 such that 


(=) = 1. We then may take m such that 


m« = —u (mod p,) (u€ U\U’'). (5) 
Next we will show that |U’| is small. Write 
—uy |2 
s=> | (=) . 
ucU pEeP; 


Each u may be written uniquely in the form u = Su; UD, where s = +1, uy > O and 
uz is squarefree. By quadratic reciprocity, 


(2) --a(2) =H). 


since p = 3 (mod 4). Given up, there are at most /y/u2 < ./y choices for u}. 
Hence, using Lemma 2.3, 


rE) 


ucU pEeF3 
2 
1/2 P 
<Dy'4| > (2)| 


u2<y pEP3 


=, yo/2te : 
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Now let 6 = so that 6x/logx < 5|P3|. If u € U’, then clearly 


| s- (=) | > Fal > ae 


per; 


1 
15 (2k)’ 


It follows that |S| >> |U’|(x/ log x)?, and consequently that 
jv <a (6) 


Let A mod M denote the set of numbers m satisfying the congruence condi- 
tions (3) and (5), where 0 < A < M. Thus, if m = A (mod M) and u ¢ U’, then 
m* + wis divisible by a prime < x/2. Let 


K= |] pe. 


x/2<p<x 


We'll take m = Mj + A, where | < j < K, and aim to show that there exists a value 
of j so that (mj + A)‘ + wis composite for every u € U’. By sieve methods (see [4]), 


K 
it: € U': (Mj +A)‘ + u prime} = yo #4 <j<K:(Mj+A)‘ + u prime} 
j=l ueU’ 


«>iK [I (1-22). 


q 
ugU" ycg</K 


By Lemma 3.1, the above product is 


lo 
e/2 
Khe ¥ log K — 


Combined with our estimate (6) for the size of |U’|, we find that 


do Hue U': (Mj + A)‘ + u prime} Kio GR- 
x E 

1<j<K 

It follows that the left-hand side above is zero for some j. That is, (Mj + A)‘ + u 

is composite for every u € U’. Therefore, (Mj + A)‘ + u is composite for every u 

satisfying |u| < y. Finally, we note that Mj + A < 2N, and the proof of Lemma 4.1 

is complete. 


Remark 4.1. For odd k the constant c(k) in Theorem 1.1 is effective. For even k it 
is ineffective due to the use of Siegel’s theorem in the proof of Lemma 3.1. 
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On the Parity of the Number of Small 
Divisors of n 


Kevin Ford, Florian Luca, Carl Pomerance, and Jeffrey Shallit 


To Professor Helmut Maier on his 60th birthday 


Abstract For a positive integer j we look at the parity of the number of divisors of 
n that are at most j, proving that for large j, the count is even for most values of n. 


Keywords Number of divisors 


Mathematics Subject Classification: 11N25, 20K01 


1 Introduction 


Let t(7) denote the number of positive divisors of the positive integer n. It is easy to 
see that t(n) is odd if and only if n is a square, so in the sense of asymptotic density, 
t(n) is almost always even. In this note we consider the function t;(n) = #{d | n: 
d < j}, the number of positive divisors of n that are at most j. Here j is a positive 
integer. Can we say that t;(7) is usually even? Evidently not. This is patently false 
for j = 1, and it is false for all odd numbers n when j < 2. Here’s another trivial 
case. Say n is not a square and n/2 <j <n. Then 1;(n) is odd. In fact, if we list out 
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all of the divisors of n: 1 = d; < dy < +++ < di) = n and choose j at random in 
[1,7], when n is not a square, more than half of the time 1;(7) will be odd, since the 
top interval [n/2,n) takes up half of the available values of /. 

We are interested in the range 2 < j < ./n, showing that 1;(n) tends to be even 
here. Let 


1 + loglog2 


6=1 
log 2 


= 0.08607... . 


Theorem 1.1. Let Nj(x) denote the number of integers n < 
Uniformly for 2 <j < Jx, 


x with t(n) odd. 


x 

No) = 0 Geapreemagiogtpyrae ) 
The theorem implies that when j is large and fixed, t;(7) is usually even as n varies. 

It is interesting to look at this problem numerically. For a fixed number j, whether 
tj(n) is even or odd depends solely on the value of gcd(n, L;), where L; is the least 
common multiple of the integers in [1,j]. That is, (mn) = t1(gcd(n, L;)). Thus, 
the set of integers n with 1;(7) odd is a union of residue classes modulo L;, so the 
asymptotic density of the set of such n exists; it is Nj(L;)/L;. 


j Lj Nj(Lj) Nj(Lj)/Lj 

1 1 1 1 

2 2 1 0.5 

3 6 3 0.5 

4 12 Ti 0.5833333333 
5,6 60 33 0.55 

7 420 225 0.5357142857 
8 840 405 0.4821428571 
9 2520 1305 0.5178571429 
10 2520 1235 0.4900793651 
11,12 | 27,720 13,635 0.4918831169 
13 360,360 177,705 0.4931318681 
14 360,360 170,775 0.4739010989 
15 360,360 170,181 0.4722527473 
16 720,720 359,073 0.4982142857 
17 12,252,240 6,106,815 0.4984243697 
18 12,252,240 5,919,705 0.483 1528765 
19 232,792,560 112,887,225 0.4849262580 
20 232,792,560 109,706,355 0.47 12622903 
21 232,792,560 110,362,725 0.4740818392 
22 232,792,560 107,787,735 0.4630205321 
23,24 | 5,354,228,880 2,496,334,995 0.4662361380 
25 26,771,144,400 | 12,782,443,905 | 0.4774709558 
26 26,771,144,400 | 12,538,223,775 | 0.4683484422 
27 80,313,433,200 | 37,368,330,615 | 0.4652812005 
28 80,313,433,200 | 36,653,106,105 | 0.4563757848 
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Our theorem implies that the right column approaches the limit 0 as j — oo 
slightly faster, at the least, than (log j)~°/C+®, 

In the following table we consider some larger values of j but only via some 
statistical experiments to approximate the density Nj(L;)/L;. The experiments 
involved taking the first 104 numbers following the kth prime, for k = 10°,2 x 
10°,...,6 x 10°. The numbers in the table are actual counts of the number of 
odd values of t;(m) among the 10* values of n. The numbers weakly suggest that 
N;(L;)/L; decays to 0 like (log j)~° where 6 is slightly above 1/2. However, this too 
is misleading. Indeed, we will show below in Theorem 2.3 that Nj(L;)/L; decays 
more slowly than about 1/(logj)*°. We do not resolve the issue of the “correct” 
exponent on log j, but we do give a suggested plan for proving it is asymptotically 6. 


li 10° 2x 10° |3x10° |4x 10° |5x 10° | 6x 10° 

| 100 4131 4121 4077 4099 4123 4109 
200 4061 4107 4174 4181 4231 4050 
300 3800 3850 3954 3980 4002 3969 
400 3630 3703 3800 3744 3877 3875 
500 3466 3587 3673 3710 3793 3772 
600 3351 3512 3526 3594 3722 3682 
700 3294 3435 3502 3543 3627 3593 
800 3213 3301 3431 3475 3577 3574 
900 2822 3245 3337 3411 3522 3477 
1000 | 2358 3197 3248 3334 3459 3439 


Throughout this note, the constants implied the by Landau symbol O and by the 
Vinogradov symbols < and > are absolute. We also use the notationA = BifA <« 
B <A. We also write a||b for positive integers a, b if a | b and ged(a, b/a) = 1. 


2 Proof of Theorem 1.1 


We begin with a criterion for t;() to be even. 


Lemma 2.1. Letj > 2. Suppose that there is a prime p||n and n has no divisor from 
the interval (j, pj]. Then t;(n) is even. 


Proof. Suppose the hypotheses hold. Let 
A={d|n:d<j,ptd, B={d|n: d<j, p| dy, 


so that {d | n : d < j} is the disjoint union of A and B. It suffices to show that 
#A = #B. For each d € A consider pd. Then pd | n and pd < pj. But by our 
hypothesis, we must then have pd < j, so that pd € B. Thus, #A < #B. Now take 
d € B. We may write d = pad’, where d’ | n, p + d’, andd’ < j/p < j. Thus, d’ € A, 
which shows that #B < #A. So, #A = #B, completing the proof. 
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For real numbers x > z > y = 1, let H(x,y, z) denote the number of integers 
n < x which have a divisor in the interval (y, z]. From the main theorem in Ford [3], 
we have the following result. 


Lemma 2.2. Suppose that y > 2 and 2y < z < y’ < x. Letu = logz/ logy —1, so 
that z= yl, Then, 


A(x, y,z) X xu (log(2/u))~?/*. 


Note that [3] states this result for y => 100 and x = 100,000, but by adjusting the 
implicit constants, the result can be seen to hold in the larger range asserted. 

We now proceed to the proof of the theorem. Let 2 < k < j be a parameter 
to be chosen shortly. First note that the number of n < x for which there is no 
prime p < k with p||n is O(v/ logk). Indeed this follows from sieve methods, in 
particular [4, Theorem 2.2]. Let u = logk/logj and z = kj = j'!*“. By Lemma 2.2, 
the number of n < x which have a divisor in (j, z] is O(xu*(log(2/u))~*/*). Let us 
equate these two O-estimates so as to fix our parameter k: 


logk\° 2logj\\~ >? 
agp (log(2/u))~7/? = x eer log = . 
logk logj logk 


After a small calculation this leads to a reasonable choice for k being 


k = exp (log j)*/"* og log(2j))!5/C+9) ; 


With this choice of k we have that the number of n < x for which it is not the case 
that both 


* there is a prime p < k with p|ln, 
¢ nis free of divisors from the interval (j, kj], 


is O(x/logk). By Lemma 2.1, if both of these conditions hold, then 7;(n) is even. 
With the choice of k given just above, this completes the proof. 

It is clear that any integer n which has no prime factors in [1, j] also has t;(n) = 1; 
that is, t;() is odd. Thus, 

x 
logj 

using [4, Theorem 2.5]. 

This “trivial” lower bound can be improved using ideas similar to those used to 
prove Theorem 1.1. 


Theorem 2.3. Let c > 0 be arbitrarily small. Uniformly for j < x'/?~° and x 
sufficiently large (in terms of c), we have 


Nj(x) > 


Xx 
(log j)* (log log(2j))3/? ” 
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Proof. It follows from [3, Theorem 4] that for j < x!/?~¢, the number of integers 
n < x with exactly 1 divisor in (j/2,/] is of magnitude x(logj)~*(log log(2j)~*/. 
Further, from the comments in the first paragraph of Sect. 1.3 in [3], the same is 
true if we ask in addition that 1 is odd. For such an odd number n, it follows by an 
argument akin to that of Lemma 2.1 that 1;(2n) is odd. The claimed lower bound 
follows. 


To close the gap between this lower bound and Theorem 1.1, the following 
strategy might be tried. It follows from Lemma 2.1 and its proof that if t;(m) is 
odd and if p is the least prime with p||n, then 7 has a divisor in (j, pj] that is divisible 
by p. (It is also possible that m has no prime factor p with p||n, but such numbers are 
negligible.) Let Nj(x, p) denote the number of integers n < x such that (i) p is the 
least prime with p||n and (ii) 1 has a divisor in (j, pj] divisible by p. Again following 
the thoughts in the first paragraph of Sect. 1.3 of [3], it may be possible to show 
that for each p < exp((logj)!/”), N,(x, p) is uniformly bounded above by a constant 
times 


1 x(log py? 
p log p (log j)* (log log(2j))?/? 


(1) 


That is, a factor p log p is introduced in the denominator due to the condition that 
p is the least prime with p||n. Summing this estimate for p < exp((log/)!/”) yields 
the estimate O(x(log j)~* (log log(2j))~*/*), with larger values of p being trivially 
negligible. Thus, we would have a match with the improved lower bound, at least 
torpae 

The estimate (1) would follow if one could show that the number Ni (x, p) of 
integers m < x/p having a divisor in (j/p,j] and such that if q||m then g > p, is at 
most a constant times the expression in (1) for p < exp((log/)!/?). Multiplying 
such a number m by p would cover all those n counted by Nj(x,p), that is, 
Nj(x,p) < N;(x,p). It would seem that upper bounding Ni(x,p) in this way is 
eminently provable using the ideas in [3], since integers m with such restrictions 
on their small prime divisors seem less likely to have a divisor in a given interval 
than integers in general. 


3 A Corollary 


We saw at the start that if j is randomly chosen in [1, 7], it is more likely than not that 
t(n) is odd, despite our theorem. This is because of the huge weight of the interval 
[n/2,n). To equalize things, we might take a harmonic measure. For y € R, y > 1, 
let t,(1) = Ty (a). Let 


1 d 
S(n) = fy € [Ln] : Hn) is odd}, f(n) = ee 
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Then we always have 0 < f(n) < 1. Further, if 7 is prime, then f(n) = 1, while if 
n = 2p where p is prime, then f(7) — 0 as p — oo. We can ask what is the normal 
value of the statistic f(n). The following corollary of Theorem 1.1 addresses this 
question. 


Corollary 3.1. There is a set of integers <& of asymptotic density 1, such that if 
n— oo withn € &, then f(n) > 0. 


Proof. Since we always have f(n) € [0,1], the assertion of the corollary is 
equivalent to 


Sf) = 0a), x > 00, 


nSx 


and this in turn is equivalent to 


¥. f(r) = o0(x%), x7 00, 


ne( 5x] 


which is equivalent to 


d 
/ — o(xlogx), x2 oo. (2) 
S(n) Y 


n€(4x,x] 
For n € (5x, x], consider its divisors 1 = d, < dy < +++ < d,q@) =n, so that 


d d; 
bn 7 eZ). 
s(n) Y dj 


i<t(n) 
i odd 


The interval (d;, dj+1) has the companion interval (n/dj+1,n/d;), which is the same 
as (dz(n)—i, de(n)—i+1)- Further if n is not a square, i is odd if and only if t(n) — i is 
odd. Thus, for n not a square, 


/ dy _» / dy 
S(n) Y S(n)N[1,./71} y 


Since the squares are negligible, to prove (2), it now suffices to prove that 


d 
/ Oe e o(xlogx), x— oo. (3) 
S(n)NEL, Jn] Y 


n€(5x.4] 
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This sum is equal to 


ee 


ne(t xxx] JS <n 
tj(n) odd 


except for a possible error of o(1) as x — oo caused by j = |./n]. Ignoring this 
triviality, the sum in (3) is now equal to 


irl ais 1 1/468). 
y log 1<x 2X = Mog 2p < x(log x) 


isi ne(4x,a] 
iSJn 


t(n) odd 


using log((j+ 1)/j) < 1/j and Theorem 1.1. Since 1/(1 +6) < 1, it follows that (3) 
holds, and as we have seen, this is sufficient for the corollary. This completes the 
proof. 


4 Final Thoughts 


Though the situation is much simpler in this note, the idea behind our Lemma 2.1 
was inspired by the argument in Maier [5]. 

One might ask about other residue classes for t;(n). Our proof can show that for 
each fixed positive integer k, the set of numbers n such that k | t(n) andk + 7;(n) has 
asymptotic density o(1) as j > oo. For k not a power of 2, it might be interesting to 
investigate the density of those numbers n where k + t(n) and also k + 1;(n). 

It is interesting to see several connections of this note to work of A.S. Besi- 
covitch. First, Ford’s theorem, cited in Lemma 2.2, is the latest chapter in a long 
story that began with work of Besicovitch [1] in 1934, when he showed that 
limy+o0 H(x, y, 2y)/x has liminf 0 as y > oo. And second, this note was motivated 
originally by looking for examples for sequences that perhaps violated a result 
known as the Besicovitch pseudometric, see [2]. In particular, it was thought if the 
densities lim,—>o9 Nj(x)/x did not approach 0, then this would be a violation. It is 
interesting that we used a descendant of the 1934 Besicovitch result to show that 
these densites do approach 0 and so there is no counterexample. 
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Counting Primes in Arithmetic Progressions 


John B. Friedlander 


To Helmut Maier on the occasion of his 60th birthday 


These notes represent an expanded version of a lecture delivered at the Urbana 
meeting of June 2014 in memory of Paul and Felice Bateman and of Heini 
Halberstam, and, in modified form, at the October 2014 workshop at the Royal 
Swedish Academy of Sciences, Stockholm, on the occasion of the presentation to 
Yitang Zhang of the 2014 Rolf Schock Prize in Mathematics for his ground-breaking 
work on bounded gaps between primes. 

Both Zhang’s work [31] and the earlier work of Goldston—Pintz—Yildirim [19], 
which had inspired it, kindled novel ideas on the use of sieve methods to produce 
prime numbers in interesting integer sequences and sparked renewed interest, if 
any reminder were needed, in the central part played in such developments by the 
distribution of primes in arithmetic progressions. 

We take advantage of this renewed activity to survey here some of the few things 
we know (180 years after Dirichlet!) about that distribution and some of the things 
we expect to be true. Inter alia, we highlight a key role which has been played in 
parts of this story, based on ideas of Helmut Maier. It is a great pleasure to offer this 
work as a tribute to Helmut on the occasion of his 60th birthday. 


1 Primes in an Individual Progression 


Dirichlet [8] set the subject on its feet by proving the fundamental result: 


Theorem 1.1. Let g be a positive integer and a an integer relatively prime to q. 
Then there are infinitely many prime numbers p with 


p =a(modq). 
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Shortly after the proof of the prime number theorem, de la Vallée—Poussin 
combined his techniques with those of Dirichlet to give an asymptotic formula 
for the number of primes up to a given point, in an arithmetic progression. 
Throughout, we shall find it convenient, following Tchebyshev, to count the primes 
with logarithmic weight. Thus we define, for (a, gq) = 1,! 


0(x; q,a) = > O(n), 


nsx 
n=a( mod q) 


where 


logp if n=p is prime, 
(a) = 
(”) 0 otherwise, 


and the expected error in the asymptotic formula, 


E(x; g, a) = 0(x; q,a) — —. 
~(q) 


where, as usual, g(q) is Euler’s function, the number of reduced residue classes 
modulo q. As is to be expected, analogous statements apply throughout the paper to 
the similar prime counting functions w(x; q, a) and 2 (x; q, a). 
De la Vallée—Poussin gave the asymptotic formula 
64,4) ~~ —, (1) 
9(q) 


with an error bound (actually, quite a bit stronger than) 
E(x; q,4) Kaq aor 


for arbitrary A, where here and throughout, we abbreviate & = logx and the 
subscript indicates that the constant implied in < is allowed to depend only on 
A and q. 

The dependence on A is unimportant but, for numerous applications, it is vital to 
obtain results which make explicit the dependence on gq and, even more crucially, 
to show that the asymptotic formula (1) holds already with q not terribly small 
compared to x, and the larger the better. The best we know today is still the Siegel— 
Walfisz theorem [30, Hilfsatz 3], 


EGG a) <a ne (2) 


'This co-primality restriction is assumed throughout, though not always explicitly mentioned. 
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which looks the same as the previous bound until you notice that the dependence on 
q has been removed. This gives the asymptotic formula as long as g > oo no faster 
than a fixed power of log x. Although of fundamental value in many applications, 
this still leaves g too small for the result to be of use in a great many others. 

However, we expect much more to be true. It follows from the generalized 
Riemann hypothesis (GRH), assumed for those Dirichlet Z-functions formed with 
the characters of modulus gq, that 


E(x;q,a) KX 2.2L", 


which implies the asymptotic formula in the range, say g < 23, 
Being more optimistic, by a conjecture of H. Montgomery which is widely 


believed, we expect 
x £ 
E(x; q,4) Ke ,J/—-x’, 
q 


which would give the asymptotic in the range q < x!~> for arbitrarily small 
positive 6. 

On the other hand, one can go too far with such statements. Montgomery [24, 25] 
actually suggested the conjecture in the stronger form 


x 4. 
E(x; q,a) Ke (=)’ Ze 
q 


which would imply the asymptotic formula, for example, in the range g < x.2~°. 
This, however, cannot hold, even if —3 is replaced by any other number. Thus, as 
a special case of the results in [15], one knows that the asymptotic formula 


A(x; q, 2) ~ a 


fails for many odd q in the range q < Q with, just for example, Q = x@~70.000.000, 


This result, as well as earlier ones [14, 17] in which the residue class a was required 
to grow with x, depends on a simple but beautiful idea [22] of H. Maier. We give a 
brief description of this in Sect. 4. 


2 Approaching Twin Primes with the Sieve 


Perhaps there is no chance to guess who might have been the first to make the twin 
prime conjecture: 
There exist infinitely many primes p such that p — 2 is also prime. 
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It is natural to try to attack this problem using sieve methods. As a first try, 
consider the sequence of numbers 


i 1 if n= m(m-—2) for some m, 
" 0 otherwise. 


We would like to study the sum 


SQ) = Do ai: 


n&x 


where 


x= 1 if (#,P)=1, 
""" 10 — otherwise. 
with P = II, <-P, hopefully with say z > 2x4 , which would imply that both m and 
m — 2 are primes. 
If we try the ancient sieve of Eratosthenes, we write 


Sx) = Yo an Yo u@ (3) 


q|n 


with ~ being the Mobius function. Following an interchange of the order of 
summation and an approximation of the resulting inner sum, our estimations founder 
on the fact that the Mobius function has too large a support, so that the variable g 
can range right up to x and, as a result, the errors in the approximation accumulate 
beyond control. 

Brun got the idea to replace the Mobius function j(g) by another function 1,, 
similar in many respects but having support restricted, say to g < Q. This lost for 
him the possibility of obtaining an asymptotic formula for the sum S(x), but the 
choice of possible sequences 1, was sufficiently rich that he could find candidates 
which gave useful upper (and for z not too large, lower) bounds. 

Proceeding, as with (3), we have 


Via > A= YK a An - 


q|n q<Q 


nax 
n=0( mod q) 
Here, the inner sum counts integers m for which 
m(m — 2) = O(modq) . (4) 


We can count these quite accurately for fairly large q (the larger, the better). As a 
result Brun [5] obtained, in particular, the following result. 
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Theorem 2.2. There exists an integer K such that, for infinitely many m, each of m 
and m — 2 has no more than K prime factors, multiplicity included. 


Indeed, Brun’s method produces many such m; see, for example, Chap. 6 of [18] for 
some explicit bounds. 
Now, let’s consider a second approach. We define a new sequence a, by 


Ae logp if n=p-—2 for some prime p < x, 
"~~ 10 otherwise. 


This has a major advantage in that we are beginning with the knowledge that at least 
one of the two integers is a prime. Proceeding as before, we have 


a Ae = ag > Topps (5) 


q|n q<Q pw 
p=2( mod q) 


Now, we can see a major disadvantage to this approach. It is far more difficult to 
estimate the inner sum which, in our earlier notation, is none other than 0(x; g, 2). 

There is yet a third possibility for the use of sieve methods in approaching the 
twin prime goal. Instead of insisting on integers differing by two and attempting 
to minimize the number of their prime factors, one may begin with more general 
clusters of integers potentially containing two primes and then seek to minimize 
the diameter of those that are found to succeed. Until the work [19], such an idea 
produced results which seemed too far from the goal to be a viable competitor 
in the approach to twin primes. That has now greatly changed. As with the 
second arrangement, this newer work rests heavily on the distribution of primes 
in arithmetic progressions and, in particular, on bounds of the type we are about to 
discuss. 


3 Primes in an Arithmetic Progression, on Average 


If, as before, we approximate 6(x; q, 2) by x/y(q) and if we assume that the “sieve 
weights” satisfy |A,| < 1 for all g, the effect of which turns out to be harmless, then 
we see that the error in the inner sum in (5) satisfies the bound 


error < ) > |E(x: 4, 2)] . (6) 
q<Q 


Now, we are back to the problem of counting primes in arithmetic progressions to 
large moduli. But we also see that, for the present purpose, we don’t need to know 
about primes in individual progressions, just on average over the moduli g < Q. We 
still want to be able to take the range of the moduli, in other words Q, as large as 
possible. 
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Actually, rather than (6), we really need only to bound the sum ys #20 Xa E(x; q, 2) 
which could be smaller, since, like j1(q), the A, can be expected to change sign quite 
a bit. For either of these sums, it would be enough to get a bound < x¥Y~? for 
this application and a bound <4, x4 would handle almost any application to 
problems which have been found to be approachable to date. 

The first bound of this type is due to Renyi [27] who showed, with particular 
reference to the twin prime problem, that 


S PGi. Di<uae (7) 
qeQ 


for a certain complicated looking but usable subset @ (related to the support of Brun 
sieve weights) of the integers {1 < g < Q} with Q = x3-®, Asa result, he became 
the first to derive the following theorem. 


Theorem 3.3. There exists an integer K such that, for infinitely many primes p, 
p —2has no more than K prime factors, multiplicity included. 


Some fifteen years later, Barban [1], using a theorem of Linnik, improved Renyi’s 
bounds of type (7) in a number of ways, in particular, extending the level to size 
Q= xE7e, Combining this with work of B. Levin, he thus achieved K = 4 as 
an acceptable choice. In 1965, Buchstab [6] used this result of Barban in more 
advantageous fashion to prove Renyi’s theorem with the value of K = 3. Finally, 
J.-R. Chen announced in 1965, with detailed proofs [7] appearing in 1973, the 
lowering of the admissible value to K = 2. 

By the time of Chen’s work, he could take advantage of the following fundamen- 
tal result, which in particular improves the Renyi and Barban exponents. 


Bombieri-Vinogradov theorem. For arbitrary A, the bound 


Y> max |E(x:9,4)| Kae x0 (8) 
<0 (a,q)=1 


holds with Q = x?~°. 


In fact, Bombieri [2] gave this in the stronger form with Q = x2.L—8 with some 
B= B(A). 

The B—V theorem has been the source of a multitude of applications. Note that 
it confirms in a certain average sense the “level of distribution” x2 which, as 
mentioned in Sect. 1, would follow from the GRH. The theorem also quickly gave 
rise to conjectures corresponding to (and in fact pre-dating) the stronger conjectures 
of Montgomery. 


—é 


Elliott-Halberstam conjecture. The bound (8) holds with Q = x!'~*. 
Actually, here too the bound was originally conjectured to hold with a level of 
the type O = xY~*; see [14] or, better still, the original work [9]. 
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In any case, the latter version was disproved in [14, 17], again based on Maier’s 
idea, whereas the former has remained widely believed and has proved to be 
important, spawning a large number of consequences. One of the most striking is its 
implication in [19] that lim inf{p,+1 — pa} < 16 (since improved to 12 by Maynard 
[23]). 

The fact that these conjectures of Montgomery and of Elliott and Halberstam 
seem reasonable for moduli of size x'~*, but are demonstrably wrong for moduli of 
size x Z~“, leads naturally to the following question. Quite apart from being able to 
prove it, 


What do we think might be the truth? 
The authors in [17] made the following 


Hypothesis. Let ¢ > 0. For all g > qo(e), (a,q) = 1, and x > q(logq)'T°, we have 


x \ 1-(—e)5(x.q) 
E(x; q,a) « (=) ; (9) 
q 
where 
1 log(log ae log, x) 
5(x,q) = min \ (10) 
2 log, x 


Here, log, stands for log log. 


An analogous hypothesis is made about the sum )> a<0 |E(x; q, a)|. 
The expression for 6 in (10) is a little complicated to digest so we illustrate it in 
two important special cases: 


(I) One expects the asymptotic formula 0(x;q,a) ~ x/@(q) to hold so long as 
q <x for any A(x) > 00 asx > 0. 

(II) We expect the asymptotic formula to hold with square-root cancellation in the 
error, that is E(x; q,a) < (x/q)2*8, provided that q < x exp(—V_Y). 


In [17] it is shown that the above hypothesis is false if one replaces 1—e by 1+¢ 
in (9). Perhaps it is dangerous to assume a result is best possible simply because one 
does not know how to improve it! 


4 Maier’s Idea 


In the context of studying primes in arithmetic progressions, the Maier approach to 
irregularity results amounts to a consideration of the double sum 


>> 6 a(rP + sq) (11) 
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as r runs through the integers in a dyadic interval R/2 < r < R ands runs through 
the integers 1 < s < S. Here, P is a parameter to be chosen and, for given r, the 
inner sum counts, with our usual logarithmic weights, the primes in a progression 
modulo g. Then summing over r, we obtain a counting over many progressions 
modulo g. Now, if for example g is a prime not dividing P or any r, then all of the 
residue classes modulo q are reduced classes. If that prime is large and if we assume 
a conjectured result for primes in progressions to such large moduli, then we can 
evaluate this total counting. 

Next, consider what happens if we interchange the order of summation in (11), 
getting 


>> > OP + sq). (12) 


Here, the inner sum is over primes in (a dyadic segment of) an arithmetic 
progression to modulus P. If the modulus P is not too large, then we know how to 
evaluate that sum and then to sum that over s. But the latter sum will involve a count, 
as s ranges, of the number of reduced residue classes modulo P. Now, however, we 
choose our parameter P (of convenient size, essentially RP = Sq and) of the form 


P=||p. (13) 
Pp<z 


The number of reduced classes modulo P has, depending on the size of z relative 
to S, a well-known oscillatory behaviour, governed by the “Buchstab” function. By 
making a choice of the parameter z so that the number of classes is relatively large 
(respectively small) compared to its mean value, we produce a rigourous evaluation 
of (12) which gives a count too large (respectively too small) in comparison with 
the count conjectured for the same double sum in the form (11). 

This explanation might lead one to ask what happens if the moduli g are much 
smaller, why doesn’t the method still work? Actually, it does work but the results it 
then contradicts are estimations so accurate that nobody would have believed them 
anyway and indeed some can be more directly seen to be false. 


5 Bilinear Forms and the BFI Conjectures 


Using elementary ideas of Tchebyshev [28], it is possible to replace sums over prime 
numbers by more flexible bilinear forms. Begun in the work of I. Vinogradov [29] 
on the ternary Goldbach problem and then fashioned into a versatile tool by Linnik 
[21] in his development of the dispersion method, these ideas have since engendered 
quite a variety of useful elementary identities for the von-Mangoldt function (see 
Chap. 17 of [18] for a representative sampling). 
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What these identities have in common is that they replace 6(k) (or A(k) or 2(k)) 
by the sum of a few weighted divisor functions 


>> cub, 


mn=k 


so that we can make a transformation of sums 


> IEG: q.a)| — sums of type Y> |Aaxp (x: q.a)|, 
q q 


where * denotes Dirichlet convolution and 


1 
A(fsaa= >> fm-—~ Do fe. 
nSx 9(4) n<x 
n=a( mod q) (n,q)=1 


In practice, the sequences a,,, 6, can be taken to be bounded by some power 
of the divisor function t(n) and, after a further subdivision, supported in boxes 
M/2<m<M,N/2 <n< N. The only arithmetic information required is that one 
of them, say f,,, satisfy a Siegel-Walfisz axiom, that is 


Ag(N; 4,4) <a ||B||WN(ogN) , (14) 


where ||... || is the £2-norm. 

One main advantage of a decomposition of this type is that it breaks a sum over 
primes into pieces which might be estimated by different methods, no one of which 
is suitable for all of them. As such, the method of bilinear forms has quite general 
capabilities. Herein, we are specifically interested in its use in proving bounds of 
B-V type. In order to prove such a theorem with Q = x" * for some 7 > s it is 
sufficient to prove the same result for all relevant bilinear forms. 

Thus, in [3] a number of such conjectures were proposed: 


Conjecture I. Let € > 0. Let am, By be sequences as above with M = x5 N = xb 
where ¢ < ¢ < | —e. Then, for arbitrary A we have 


YS Aang (9.0) Kaa |loe||[Bl| WMN-2~ (15) 


q<Q 
(g,a)=1 


with O = xY~* and some B = B(A). 


Actually, this is a special case of the conjecture, which was phrased a little more 
generally, in terms of £,-norms. 

Some up-dating remarks about this are in order. Although Conjecture I has not 
yet been disproven, following the work described in the previous section, it would be 
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advisable to postulate the safer level OQ = x'—®, Once one does this, it now becomes 
reasonable to also postulate the independence of the implied constant on a (which, 
at the time, was irrelevant to the work in [3]). This gives 


Conjecture II. In the notation of the previous conjecture, we have 


max |Aaxp (x; 4,4)! Ka,¢ [lol BIW MN2™ , (16) 


eT (a.g= 


with Q = x!*. 
Also in [3], a still weaker conjecture was proposed. 
Conjecture III. Same as Conjecture I, but with Q = xe, 


Here, the point was that this weaker conjecture (after a use of Cauchy’s 
inequality, which set this lower limit) had become something that was actually 
provable, modulo assumption of the expected bounds for exponential sums over 
certain rational functions in several variables. 

It should be mentioned that, in very recent work, D. Polymath [26] has revived 
the use of conjectures very similar to Conjectures I and II, to which they refer 
as Generalized Elliott-Halberstam conjectures, and from which they derive the 
exciting consequence that liminf{p,+1 — px} < 6. Thus, sharpening the earlier 
conditional bounds of GPY and of Maynard, they show that, not only do such 
conjectures, as has been known for quite a while, imply the Elliott-Halberstam 
conjecture, but apparently they can imply important consequences stronger than 
those implied by the Elliott-Halberstam conjecture on its own. 


6 B-V Type Theorems with Special Weights 


We have seen that, in applications, it is frequently unnecessary to have bounds for 
the sum > , E(x: q, a)| but rather for weighted sums 


~ AgE(x: g, 4) (17) 


q<Q 
(.a=1 


where the coefficients 1, may be more flexible than those given by the absolute 
value signs. 

For example, if we have A = y * 6 with y, supported on R/2 < r < R and 6, 
supported on S/2 < s < S with RS = Q (or if it can be written as a sum of not too 
many of these convolutions), then we are in a position to estimate along the lines of 
the previous section. 
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Already in the work of Renyi, the relevant sum was over a subset 2 of moduli. 
Following the proof of the Bombieri—Vinogradov theorem, the search for levels of 
Q beyond the square-root barrier returned the focus to the use of flexible weighted 
sums, in particular the well-factorable weights of Iwaniec [20], of great importance 
for their use in applications of the linear sieve. For these weights, following an initial 
breakthrough (beyond the Riemann Hypothesis) in [13], the level Q = x77~® was 
reached in [3]. For the problem at hand, this permits an upper bound for the number 
of twin primes up to a given point, one which sharpens by a factor 7/8, the bounds 
which, following Selberg’s work, had been achievable, at first only on assumption of 
GRH, and later unconditionally, after the availability of the Bombieri—Vinogradov 
theorem. 

The Iwaniec well-factorable weights are a little complicated to be briefly 
described. We give here three further examples of bounds for sums (17) having 
weights of greater or lesser flexibility. 


Example I. Although it is the most recent, we mention as our first example that of 
Yitang Zhang [31], one which is extremely flexible. We take, for z not too large, the 
very smooth numbers 


2Q=(q<Aplasp<3 
and define 


4 —$sen (E(x;q.4)) if qe Q 
710 otherwise. 


For this choice of A,, Zhang was able to prove the bound 


Do reE (a g.a) = YO (ECG. )| Ka x2, 
q<Q qe2Q 


uniformly in a = a(q) of special arithmetic type and with Q = x27" fora specific 
positive 7. Because of this uniformity in a (any such 7 would have sufficed), he 
was able to derive a spectacular consequence, the first unconditional proof that 
lim inf{Pr+1 — Pn} < 00. 


In the next example, we shall take a to be fixed. In other words, we allow the 
implied constants to depend on the residue class a. 


Example IT. Itis natural to ask what level Q one can reach if one asks, in contrast to 
the previous example, for very inflexible weights. If no flexibility at all is allowed, 
such as with absolute values, then we still cannot take a fixed exponent 7 > 0 
although, if a dependence on a is allowed, then [4] comes arbitrarily close. 


As an example having some, but minimal, flexibility, we have the following 
special case of Theorem 8 in [3]. 
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Theorem 6.4. Leta #0,A>0,¢>0,0 = 5+ 24-e. Then, 


: —A 

» ) E(x; pip2,4) Kase XL 

‘i é 
pi<xi3? po<xi3° 

where, as usual, p\, pz run over primes. 


Actually, the result holds uniformly in a up to a small power of x, depending on ¢. 


Example II. Our final example is the simplest that one could think of; we take A, 
to be identically equal to 1. This example, which has application to the Titchmarsh 
divisor problem, was independently studied in [3, 12]. 

What is of particular interest here is that we can successfully treat the sum 
> q<o E(%: g, 4) with O = x!~®, due to the Dirichlet switching principle. In other 
words, if we consider the equation 


then the roles of g and r are symmetric and at least one of the two is < x2. Asa 
result, as soon as you can push a little beyond the Bombieri—Vinogradov level, you 
get to go quite a bit further. 


The fact that we can succeed with such large g means that we can rigourously 
investigate the limit in the size of the level Q for which a bound 


tae Sal (18) 


q<Q 
(q,a)=1 


can hold. We find that the answer depends on the residue class a, not so much its 
size but rather its arithmetic structure. Thus, we have the following result of [16] 
(which heavily uses [4]). 


Theorem 6.5. Fix A > 1, M > 1, e > 0. For some B depending on A and M and 
for all large x, the bound (18) holds for every Q < x£~* and every 0 < |a| < x 


satisfying 


aa < Mlog, x 


pla 


but, for every B and all large x, the bound (18) fails for some Q < x~® and some 
a, 0 < jal < x, satisfying 


pe < (log, x)ste : 


pla 


If the Riemann Hypothesis is assumed then, in the last exponent, © can be replaced 
by 1. 
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Thus, under RH, the result is tight apart from the epsilon. By taking an M > 1, we 
see that it is the first case that applies to a typical residue class a. 


Finally, it should be mentioned that, in connection with this last example, there 


are interesting further developments in recent work [10, 11] of Fiorilli. 


Acknowledgements The author’s research is partially supported by a University Professor Grant 
from the University of Toronto and by the Natural Sciences and Engineering Research Council of 
Canada through Research Grant A5123. 


References 


1. 


2 
3. 


4. 


18. 


19. 


M.B. Barban, The “density” of the zeros of Dirichlet L-series and the problem of the sum of 
primes and “near primes”. Mat. Sb. (N.S.) 61, 418-425 (1963) 

E. Bombieri, On the large sieve. Mathematika 12, 201-225 (1965) 

E. Bombieri, J.B. Friedlander, H. Iwaniec, Primes in arithmetic progressions to large moduli. 
Acta Math. 156, 203-251 (1986) 

E. Bombieri, J. B. Friedlander and H. Iwaniec, Primes in arithmetic progressions to large 
moduli HI. J. Amer. Math. Soc. 2, 215-224 (1989) 


. V. Brun, Uber das goldbachsche Gesetz und die Anzahl der Primzahlpaare. Arch. Math. 


Naturvid. B34(8), 1-19 (1915) 


. A.A. Buchstab, New results in the investigation of the Goldbach-Euler problem and the 


problem of prime pairs. Dokl. Akad. Nauk SSSR 162, 735-738 (1965). (Soviet Math. Dokl. 6, 
729-732 (1965)) 


. J.R. Chen, On the representation of a larger even integer as the sum of a prime and the product 


of at most two primes. Sci. Sinica 16, 157—176 (1973) 


. G.L. Dirichlet, Beweiss des Satzes das jede arithmetische Progression deren erstes Glied 


und Differenz ganze Zahlen ohne gemeinschaftlichen Factor sind, unendlich viele Primzahlen 
enthalt, Werke pp. 313-342, Kon. Preuss. Akad. der Wissen., Berlin, 1889. Reprinted Chelsea, 
New York, 1969 


. P.D.T.A. Elliott, H. Halberstam, A conjecture in prime number theory, in Symposia Mathemat- 


ica, vol. T1V INDAM, Rome, 1968/69) (Academic Press, London, 1970), pp. 59-72 


. D. Fiorilli, Residue classes containing an unexpected number of primes. Duke Math. J. 161, 


2923-2943 (2012) 


. D. Fiorilli, The influence of the first term of an arithmetic progression. Proc. Lond. Math. Soc. 


106, 819-858 (2013) 


. E. Fouvry, Sur le probléme des diviseurs de Titchmarsh. J. Reine Angew. Math. 357, 51-76 


(1985) 


. E. Fouvry, H. Iwaniec, Primes in arithmetic progressions. Acta Arith. 42, 197-218 (1983) 
. J.B. Friedlander, A. Granville, Limitations to the equi-distribution of primes I. Ann. Math. (2) 


129, 363-382 (1989) 


. J.B. Friedlander, A. Granville, Limitations to the equi-distribution of primes III. Compos. 


Math. 81, 19-32 (1992) 


. J.B. Friedlander, A. Granville, Relevance of the residue class to the abundance of primes, in 


Proceedings of the Amalfi Conference (Maiori, 1989) (Universita di Salerno, Salerno, 1992), 
pp. 95-103 


. J.B. Friedlander, A. Granville, A. Hildebrand, H. Maier, Oscillation theorems for primes in 


arithmetic progressions and for sifting functions. J. Amer. Math. Soc. 4, 25-86 (1991) 

J.B. Friedlander, H. Iwaniec, Opera de cribro, in American Mathematical Society Colloquium 
Publications, vol. 57 (American Mathematical Society, Providence, RI, 2010) 

D.A. Goldston, J. Pintz, C.Y. Yildirim, Primes in tuples I. Ann. Math. 170(2), 819-862 (2009) 


114 J.B. Friedlander 


20. 
21; 


22. 
23. 
24. 


25), 


26. 


27. 


28. 


29. 
30. 
31 


H. Iwaniec, A new form of the error term in the linear sieve. Acta Arith. 37, 307-320 (1980) 
Yu.V. Linnik, The Dispersion Method in Binary Additive Problems (American Mathematical 
Society, Providence, RI, 1963) 

H. Maier, Primes in short intervals. Michigan Math. J. 32, 221-225 (1985) 

J. Maynard, Small gaps between primes. Ann. Math. 181(2), 383-413 (2015) 

H. Montgomery, in Topics in Multiplicative Number Theory. Lecture Notes in Mathematics, 
vol. 227 (Springer, Berlin/New York, 1971) 

H. Montgomery, Problems concerning prime numbers. in Mathematical Developments Arising 
from the Hilbert Problems, Northern Illinois University, De Kalb Illinois, 1974. Proceedings of 
Symposia in Pure Mathematics, vol. XXVIII, pp. 307-310 (American Mathematical Society, 
Providence, RI, 1976) 

D.H.J. Polymath, Variants of the Selberg sieve, and bounded intervals containing many primes. 
Res. Math. Sci. (2014) (to appear) 

A. Renyi, On the representation of an even number as the sum of a single prime and single 
almost-prime number. Izvestiya Akad. Nauk SSSR. Ser. Mat. 12 (1948) 57-78. (Amer. Math. 
Soc. Transl. 19(2), 299-321 (1962) 

P.L. Tchebyshev, Sur la function qui détermine la totalité des nombres prémiers inferieurs a 
une limite donnée. J. Math. Pures et Appl. 17, 366-390 (1852) 

I.M. Vinogradov, Selected Works (Springer, Berlin, 1985) 

A. Walfisz, Zur additiven Zahlentheorie II. Math. Zeit. 40, 592-607 (1936) 


. Y. Zhang, Bounded gaps between primes. Ann. Math. (2) 179, 1121-1174 (2014) 


Limit Points of the Sequence of Normalized 
Differences Between Consecutive 
Prime Numbers 


Daniel A. Goldston and Andrew H. Ledoan 


Dedicated to Helmut Maier on the occasion of his 60th birthday 


Abstract Let p, denote the nth prime number and let d, = py+1—py denote the nth 
difference in the sequence of prime numbers. Erd6s and Ricci independently proved 
that the set of limit points of ae the normalized differences between consecutive 
prime numbers, forms a set of positive Lebesgue measure. Hildebrand and Maier 
answered a question of Erdés and proved that the Lebesgue measure of the set of 
limit points of ae in the interval [0,7] is >> T as T > oo. Currently, the only 
specific limit points known are 0 and oo. In this note, we use the method of Erdés to 
obtain specific intervals within which a positive Lebesgue measure of limit points 
exist. For example, the intervals [z 2] and [a- 1] both have a positive Lebesgue 
measure of limit points. 


Keywords Hardy—Littlewood prime k-tuple conjecture * Limit points of normal- 
ized differences between consecutive prime numbers * Singular series 


1 Introduction 


In this note, we are concerned with the limit points of the sequence of normalized 
differences between consecutive prime numbers. Let p, denote the nth prime 
number and let d, = py+1 — Pn denote the nth difference in the sequence of prime 
numbers. By the prime number theorem, the average size of the differences d,, is 
log p,, and therefore it is interesting to examine the distribution of the limit points 
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of the normalized differences a Since log p, ~ logn as n — oo, one can equally 


well consider the sequence oS which has the same set of limit points.! 

It is difficult to find specific limit points for the sequence of normalized prime 
differences. In 1931 Westzynthius [12] proved that oo is a limit point, and it was 
only in 2005 that Goldston et al. [6] proved that 0 is also a limit point. We expect 
that every non-negative real number is a limit point, and this is a consequence of the 
Hardy—Littlewood prime tuple conjectures. 

We are concerned here with a result of Erdés [5] and Ricci [10] who, in 1954, 
independently proved that the set of limit points of we has positive Lebesgue 
measure. As we will see, the limit points obtained by this method are not large. For 
example, no finite limit point larger than 1 can be proven to exist by this method, 
and the largest Lebesgue measure for the set of limit points that one can ever hope to 
prove by this method is certainly less than 1. It was, therefore, a great breakthrough 
when in 1988 Hildebrand and Maier [8] proved, among other more general results, 
that the Lebesgue measure of the set of limit points of me in the interval [0, T] is 


>> T as T > oo. Their result implies that the erehee me has arbitrarily large 
finite limit points, which more than settled a question of Erdés to prove that there is 
at least one finite limit point greater than 1. 

In this note, we use the method of Erd6s to obtain specific intervals in which there 
must be a positive Lebesgue measure of limit points. A representative example of 
our results is the following Corollary of our Theorem. 


Corollary. The intervals [3 72 and [a 453 1] each contain a positive Lesbegue 


measure of limit points of the sequence jon’ 


2 Erdés’s Method 


The only information required concerning prime numbers in Erd6és’s method is the 
prime number theorem and a sieve upper bound for pairs of prime numbers. The 
prime number theorem states that 


U(x) := yoi~ cae as Xx > ©. 


PnSx 


We will use the following two relationships that follow immediately from this: 


a es (1) 
log x 


X<pn<2x 


'Erd6s proved his result with this latter sequence, while Ricci used the former sequence. 
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and 


> dy ~X, aSx—> oo. (2) 


X<pyn<2x 


Let p denote a prime number. The singular series G(k) is defined for all integers 
k #£Oby 


-1 
2C> I] (255) , if k is even; 


G(k) = plk (3) 
>2 
0, if k is odd; 
where 
=|] ia—" — | =9e0is (4) 
2= (p _ 1p =U. weed 
p>2 
Then the upper bound estimate on prime pairs that we need is 
x 
Y> 1<(€+096H—,,, (5) 
(log x)? 
x<p<2x 
p+k prime 


which holds for any arbitrarily small positive constant € and all sufficiently large x. 
Here, @ is a constant. 

The value @ = 4 in (5) is due to Bombieri and Davenport [2] (See also [7]). If 
k < (logx)® for any given B, Bombieri et al. [3] obtained @ = q Their result is 
only stated for the twin primes, but the above generalization holds. The best current 
results are due to Wu [13], who obtained the constant @ = 3.91045 uniformly for 
all k, and @ = 3.3996 if k < (log x)? for any given B. 

We will obtain our results from estimates for the consecutive prime gap counting 
function for prime numbers in the interval (x, 2x] defined by 


GM) =Gix,M)= >> 1. 
X<pn<2x 
<M 


In 
logpn — 


-1 
G*(M) := nmr (i) G(x, ) 
X00 logx 


From (5), we prove the following bound on normalized consecutive prime differ- 
ences in an interval. 


Further, we define 
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Lemma 2.1. For fixed numbers 0 < a < b, and © a real number for which (5) 
holds, we have that 


Gb)-G@= YD 1<(@+0b-a— (6) 
; log x 
X<py<2x 
4<fogpn <b 


holds for any arbitrarily small positive constant € and all sufficiently large x. 


Using Lemma 2.1 and a standard analysis argument, we obtain the following 
lower bound on the measure of the set of limit points. 


Lemma 2.2 (Erdds [5]). Let 


S(M) = dy : dn <M 
7 log Pn log Pn ~ 


and let 
S*(M) = {c:c is a limit point of S(M)}. 
Then we have 
m(S*(M)) > ZG" (M), 


where m(E) denotes the Lebesgue measure of the set E. 


We now need to find lower bounds for G*(M). Erdés found two methods for 


doing this. The simple method he used in [5] obtains a non-trivial lower bound for 


M > 1, while the method in [4] obtains a non-trivial bound for M > 1 — xe: 


Lemma 2.3. Fix a real number M > 1. Then we have 
1 
G*(M) >1-——. 7 
(M) = rv; (7) 


IfM>1- aa then we have 


+5 


*(M) > 1-—=——.. 
oO. GM+1 


(8) 
The first lower bound (7) is better than (8) when M > 2. Combining Lemmas 2.2 

and 2.3, we obtain the following Theorem. 

Theorem. Let a be any real number for which a > @. Then each of the intervals 


E (1-5) |. for any M > 2, (9) 


a 
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and 


oa fi j--22M <2 (10) 
-f1—- ; , forany 1—- — : 
a aM +1 y 36 


. ease ook : d 
contains a positive Lesbegue measure of limit points of the sequence ipa 


Proof of the Theorem. Combining Lemma 2.2 and (7), we have 


n(vwoo[E(--2) ») =mtro-m(lo2(+-3)) 
=(E-D(-s)> 


which proves that (9) contains a positive Lebesgue measure of limit points. For (10), 
since 
u+ 5 
uM + 1 


is an increasing function of u for u > 0 when M < 2, we have for any a > @ and 
1-55 <M <2 that 


a+% 
aM +1 


G*(M) > 1—- 


The same argument just used for (9) with this lower bound for G* shows (10) has a 
positive Lebesgue measure of limit points. Oo 


The Corollary mentioned in the previous section is obtained by taking the value 
a = 4 which is valid since we can use a value of @ < 4 and take M = 2 in (9) and 
M = 1 in (10). 


3 Proof of the Lemmas 


In this section, the letter € will denote a small positive number which may change 
each time it occurs. 


Proof of Lemma 2.1. We make use of the sieve bound (5) together with the singular 
series average 


T(N) := )°G6(k)~N, asN > oo. (11) 


k<N 
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(See [2].) Let p’ denote, here and henceforth, a prime number that is not necessarily 
adjacent to the prime number p. Then we have 


yi a 


X<ppS2x x<p<2x 
dn <<p’—p< 
aS ior bn <b alogx<p'—p<b log 2x 


= ae 


alogx<k<blog2x x<p<2x 
p+k prime 


x 


alogx<k<b log 2x 


<(@+ (ba) 


ZX 
This completes the proof of Lemma 2.1. oO 


Proof of Lemma 2.2. Since S*(M) is a set of limit points in the interval [0, M], it is 
a closed and bounded set of real numbers, and is therefore compact. We construct a 
sequence of open covers of S*(M), the kth one being defined by 


CO. = yt — (c-Ze+ z)ice stan} F 


Letting 


a= (J LW, 


ceS*(M) 


we have that 
lo. ) 
S*(M) = ( \ Ox. 
k=1 


Since {Q;}72, is an infinite decreasing sequence of measurable sets, i.e., a sequence 
with Q;4;, C Q; for each k, and since m(Q,) < M + 2, we have 


m(S*(M)) = m (A a, = Jim m@Qv), 
k=1 


by a standard property of Lebesgue measure. (See [11], Proposition 14, pp. 62-63.) 
Thus, given any € > 0, we can find a sufficiently large number kp such that 


m(S*(M)) = m(Qio) — €. (12) 
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By the Heine—Borel theorem, the open covering ©, of S*(M) has a finite subcover- 
ing {I,, (ko),...,Ic,(ko)} with c; € S*, 1 <i<r, and 


S*(M) CJ kei(ko) C Oro. 


i=1 


The open intervals /,,(ko), with i = 1,...,7, may not be disjoint from each other. 
However, since the union of two overlapping open intervals is itself an open interval 
containing both of the original intervals, we may combine overlapping open 
intervals to obtain a smaller number of disjoint open intervals with the same union 
as the original intervals. As a result, we have obtained a finite collection {1;,...,J;}, 
with s < r, of disjoint open intervals such that 


S*(M) CJC OQ. 


i=1 


Letting £(/) denote the length of the interval J, we have by (12) that 


m(S*(M)) =m (U 1) -e€= So edi) —€. 


i=1 i=1 


By Lemma 2.1, 


-1 
a= (era) Yn. 


X<pn<2x 
dn é 
log pn eli 
dn 


7 lies in an interval J; for some 
Og Pn 


and since for all sufficiently large n each number 
i, we have 


-1 
m(S*(M)) > («+ on) G(,M) -«, 


for x sufficiently large. Since the left-hand side of this inequality does not depend 
on x, we take the lim inf of the right-hand side and obtain Lemma 2.2. Oo 


Proof of Lemma 2.3. Since 


dy, 
MN) » 12 3 (1-74) 


X<pn<2x X<pnS2x 
mu <M dn<M log x 
log Pn (13) 
1 
> Mlogx—d,) =: H(M), 
rior > (Mlogx = dn) =: H(M) 
X<pn<2x 


dn <M log x 
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by (1) and (2) we have 


1 


H(M 
0 Mlogx 


> (M log x ~~ dn) 


X<pn<2x 


1 
> Ease 2 dn (14) 


X<Pyp<2x X<py<2x 


1 1 x 
(Gaol 

x =i 1 
(ic) Gu = (1~5,-¢), 


for x sufficiently large, and (7) follows. 

We can improve on this argument when M is close to 1 by using the sieve 
bound (5) which prevents the lengths of all of the consecutive prime differences 
from being concentrated in a very small interval. 

Letting A > 1 be a fixed number that will be chosen later in terms of M and @, 
we have by (14) that 


II 


IV 


Hence 


for x sufficiently large. Now taking M < 1, we have 


H(A) = Ton > (A logx — d,) + > (A log x — dy) 


Al 
x<pnS2x X<pnS2x 
dn<M log x M log x<dy<A logx 
p=pP 
< G(M) + ) (1 ae ) 
ogx 
x<p<2x & 


Mlogx<p’—p<A log x 


= G(M) + H(A, M). 


We will show below that 


j= 2 
Hy(A,M) <(6 +) 5 (=). 
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and therefore on combining these last three equations we have 


x \7 1 (A — My? 


1 € 5 
> 5 (4-1- Fa-my)-« 


The choice A = M + z maximizes the expression in the parentheses, and with this 
choice we obtain 


+35 


-1 
x 

|) G@p2i=- 2 =< 
(<) (M)21- oaai< 


and (8) follows. 
To prove the bound for H, we see that the same calculation used in the proof of 
Lemma 2.1 gives 


MAM) <(@+9e—, (1- ) ew 


log x)? 
( 8 ) M log x<k<A logx 
Xx 


= (@ + €)T,(A log x, M log x) ——., 
(log x)? 


where 


nwwy= (1-Z)ew, 


V<k<Ww 


and V = V(x), W = W(x), and W —> oo as x > oo. By partial summation, we have 
using (11) that 


T\(W, V) = [ (1 = 7) dT(u) 


II 


w 
af (W —u)du+ 0(W) 


1 2 
ay — VP + 0M), 


and the bound for H, follows. This finishes the proof of Lemma 2.3. oO 
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4 Further Results and Comments 


Recently, Pintz [9] proved that there is an ineffective small positive constant c for 
which the interval [0, c] consists entirely of the limit points of oH. His proof makes 
use of Zhang’s recent breakthrough [14] on the existence of bounded differences 
between prime numbers. Banks et al. [1] have used the even more recent methods of 
Maynard and Tao to prove that for any set of real numbers 0 < A, < Aq <<... < Ax 
at least one of the differences A; — A;, with 1 < i <j < k, is a limit point of ae 
provided k > 9. This implies far stronger results than anything in this paper. In 
particular, they obtain that at least 12.5 % of all the positive real numbers are limit 
points of the sequence of normalized consecutive prime differences. With regard to 
Erd6s’s question of limit points greater than 1, they obtain that such limit points 
must exist in the interval [1,8 + 7], for any 7 > 0. 

It has been pointed out to us by Julian Ziegler Hunts that one can generalize the 
results of the current paper to examine limit points of “+*"", in which case the 
intervals obtained should be unattainable by the method of Banks et al. when r is 


large enough. 
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Spirals of the Zeta Function I 


Steven M. Gonek and Hugh L. Montgomery 


To Professor Helmut Maier on his 60th birthday 


Abstract Assuming RH, it is shown that the curve ¢(1/2 + it) spirals in the 
clockwise direction for all sufficiently large ¢, in the sense that its curvature is 
negative. 


1 Introduction 


It is well known that arg €(1/2 + if) increases for 0 < t < t, = 6.289836, and 
that it is decreasing for tf > t, apart from jump discontinuities of height mz at 
t if there are precisely m > 0 zeros 6 + iy (counting multiplicity) of ¢(s) with 
y = t. This behavior depends on the Hadamard product and functional equation 
of the zeta function, but is independent of its Euler product, and of the Riemann 
Hypothesis (RH). It may be similarly noted that the curve (1/2 + if) turns in the 
counterclockwise sense (i.e., K(f) > 0) for 0 < t < tf) = 2.7564883864, and it 
seems that the curve turns in the clockwise direction (k(t) < 0) for t > to. We 
confirm this, assuming RH (Fig. 1). 


Theorem 1.1. Assume RH, and let x(t) denote the curvature of the parameterized 
curve €(1/2 + it). There is a ty > 0 such that the curve turns in the clockwise 
direction for all t > ty, which is to say that k(t) < 0 for allt > to. 


It might still be the case that x(t) < 0 for all large t even if RH is false. Whether or 
not depends a great deal on the positions of zeros off the critical line. It is therefore 
instructive to examine the issue for a function that has a functional equation similar 
to that of the zeta function, but has no Euler product. Titchmarsh [4, Sect. 10.25] 
defined a function f(s) to be a linear combination of two quartic L-functions modulo 
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Fig. 1 The curve ¢(1/2 + it) for 0 < t < 40, with €(1/2 + it) and €(1/2 + it,) 


5, and showed that it has zeros with real part > 1, although it satisfies a functional 
equation. As was first discovered by Spira [3], the first zero of f(s) off the critical 
line is at 9o = 0.1914828 + 185.69934849, together with 1—@o, 0, and 1—g. Levinson 
and Montgomery [1] showed that under suitable circumstances zeros f + iy of €(s) 
with 0 < B < 1/2 are in one-to-one correspondence with zeros 6; + iy; of C’(s) 
with 0 < 6; < 1/2. The same reasoning applies to f(s), so f’(s) has exactly one zero 
f, +iy, in the rectangle —1 < B; < 1/2,2 < y; < 100, namely 9; = 0.4053715+ 
i85.70513603. Let «,(t) denote the curvature of the curve f(1/2 + it). Then x(¢) 
is generally negative, but this zero Q, just to the left of the critical line causes x;(f) 
to be positive in the interval [85.559071, 85.849586]. In particular, (v1) = 9.856. 
The curve f(1/2 + it) for t near this height is plotted in Fig. 2. Numerical studies 
reveal that f(s) has exactly 252 zeros with 8 < 1/2 and 0 < y < 5000. Each 
one is accompanied by a nearby zero Q, of f’(s), and k(y1) > 0 for all but 10 
of these y,. The first example of a zero g with 6 < 1/2 for which the curvature 
remains negative occurs when 9 = 0.1022498-+ 12442 .530292 with associated 0; = 
0.2664654 + 12442.535402 for which k¢(y1) = —0.232836. Moreover, k(t) < 0 for 
2425.05 < t < 2452.43. Assuming that the one-to-one association of 9 with @; can 
be continued, let Ny_(T) denote the number of zeros @ of f(s) for which B < 1/2, 
0<y <T, and kf(y1) < 0. Similarly, let Ny+ (7) denote the number of zeros @ of 
f(s) for which B < 1/2,0 < y < T, and k(y1) > 0. Not only do we conjecture 
that both Ny_(T) and Ny+ (T) tend to infinity with T, but also that Ny_(T) >> T and 
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Fig. 2 The curve f(1/2 + it) for 80 < t < 90 


Ny+(T) > T for all large T. Indeed, we find it plausible that Ny_(T)/T — oo and 
Ny+(T)/T — oo as T — oo. However, we offer no speculation as to the relative 
asymptotic sizes of Ny_(T) and N+ (T). 


2 Proof of the Theorem 


We show first that the curvature of the curve ¢(1/2 + if) at height ¢ in the complex 
plane is 


RS(/2+4 it) 


“OF Tea]a+ il 


(1) 
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To this end we write the tangent vector if’(1/2 + it) in polar coordinates, so that 
i¢’(1/2 + it) = Re. Here 


— ; 4+ arg "(1/2 + it) = 5 + Slog t/(1/2 + it). 


By the Cauchy—Riemann equations it follows that 


da mas 
ge 7 p24 io. (2) 


With s denoting arc-length, we have 


d. 
= =R=(f0/2 +0). 3) 
Since 
da 
= da = dt 
K(t) = ag as (4) 
dt 


the relation (1) follows from (2) and (3). 

Levinson and Montgomery [1], assuming RH, showed that ‘i © (s) < 0 for s in 
various regions, and thus gave a new proof of Speiser’s theorem [2] to the effect that 
é’(s) 4 0 for 0 < o < 1/2 if and only if RH is true. In the same way that it is 
natural to express £(s) in terms of a sum over zeros of €(s), one may write a (s) in 
terms of a sum over zeros of ¢’(s). In this way, Yildirim [5] showed, assuming RH, 
that ¢’’(s) 4 0 for 0 <o < 1/2. In particular, Yildirim showed, assuming RH, that 


1 
i ee (5) 
for t > 100. Since it is an easy matter (using, e.g., Maple or Mathematica) to plot 
k(t) for 2 < t < 100, it follows that x(t) < 0 for t > to where fo is defined to be the 
root of ne (1/2 + it) = 0 near 2.75. 
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Best Possible Densities of Dickson m-Tuples, 
as a Consequence of Zhang—Maynard—Tao 


Andrew Granville, Daniel M. Kane, Dimitris Koukoulopoulos, 
and Robert J. Lemke Oliver 


Abstract We determine for what proportion of integers h one now knows that there 
are infinitely many prime pairs p, p + h as a consequence of the Zhang—Maynard— 
Tao theorem. We consider the natural generalization of this to k-tuples of integers, 
and we determine the limit of what can be deduced assuming only the Zhang— 
Maynard—Tao theorem. 


1 Introduction and Statement of Results 


The twin prime conjecture states that there are infinitely many pairs of integers 
(n,n + 2) which are simultaneously primes. More generally, Hardy and Littlewood 
conjectured that each entry of the k-tuple (n + hj,--- ,n + hx) should be prime 
infinitely often, unless there is a trivial reason why this cannot happen. This 
“trivial reason” is about divisibility by small primes p. For example, the triplet 
(n+ 2,n+ 4,n + 6) can never all be prime if n > 1 because at least one of them 
must be a multiple of 3. So we call a k-tuple admissible if, for each prime p, the 
reductions of the numbers /,,..., 4, modulo p do not cover all of Z/pZ. With this 
definition in hand, the Hardy—Littlewood conjecture states that if (4,,..., 4) is an 
admissible k-tuple, then there are infinitely many integers n for which the numbers 
n+h,...,n+ hy are all prime. 

Even though we are still far from proving the full Hardy—Littlewood conjecture, 
there has been remarkable progress made towards it recently. Firstly, in May 2013, 
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Yitang Zhang [6] made headlines by proving that there are bounded gaps between 
primes, and specifically that 


lim inf Pp4-1 — Pn < 70,000,000, 
noo 


where p, denotes the n-th prime. Then, in November 2013, James Maynard [3] 
and Terence Tao independently showed, using somewhat different techniques, that 
70,000,000 can be replaced by 600, and, as it stands right now, the best bound 
known is 246, due to the Polymath project [5]. Even more impressively, they proved 
that for any integer m > | there is an integer k = k,, such that if (4,,..., 4) is an 
admissible k-tuple, then there are infinitely many integers n for which at least m of 
the numbers n + /1,...,2 + hx are prime. Obviously, k,, => m, and in [5] it was 
shown that one can take kp = 50 andk, « e382”, 

We call a k-tuple of integers (/),...,h,) a Dickson k-tuple if there are infinitely 
many integers n for whichn+/1,...,n-+ hy are each prime. The Hardy—Littlewood 
conjecture is equivalent to the statement that “all admissible k-tuples of integers are 
Dickson k-tuples”, and the Maynard—Tao theorem implies that “every admissible 
k-tuple of integers contains a Dickson m-tuple’. In particular, the Maynard—Tao 
theorem implies that Dickson m-tuples exist, yet no explicit example of a Dickson 
m-tuple is known! Nevertheless, a simple counting argument (which also appeared 
in [2]) yields the following attractive consequence of the Maynard—Tao theorem. 


Corollary 1.1. A positive proportion of m-tuples of integers are Dickson m-tuples. 
Proof. Let k = k», so that m < k, and define R = [l<eP and x = NR for some 
(very large) integer N. We let 

MN = {né (—-x,x]: (n,R) = I}, 


so that |.”| = 2x(R)/R. Any subset of k elements of .V is admissible, since it 

does not contain any integer = O (mod p) for any prime p < k. There are (es ') 

such k-tuples. Each contains a Dickson m-tuple by the Maynard—Tao theorem. 
Now suppose that there are T(x) Dickson m-tuples within .”. Any such m-tuple 


is a subset of exactly (ja) of the k-subsets of ./, and hence 


T(x): i > (‘;"). 
k-—m k 
and therefore 


(1) Lem) yy — (BY eogym 
ro) = ( : )/( co) = ert = (22) - 20" 


as desired. oO 


Best Possible Densities of Dickson m-Tuples 135 


The main goal of this paper is to provide better lower bounds on the proportion 
of m-tuples that are Dickson m-tuples. If A(m) is the proportion of such m-tuples, 
then Corollary 1.1 implies that A(m) > 0. We are interested in determining the best 
possible lower bound on A(m) assuming only the results of Zhang, Maynard, and 
Tao and treating them as “black boxes”: If Y,, is the set of Dickson m-tuples, then 
there is an integer k = k,, for which: 


¢ Q,, is translation-invariant, that is to say, if (41,...,/m) € Bn andt € Z, then 
(A, +¢,...,4m +t) © Bn 

* Q, is permutation-invariant, that is to say, if (11,...,Am) € Bn ando € S,», then 
(hea); oe) Nom) € Dn; 

¢ for any admissible k-tuple, (41,...,2x,), there exist distinct ,...,4m € 


{x1,...,X,} such that (14,..., hm) € Bn. 


We call any set A C Z” that has the above properties (m, k)-plausible. Then we 
define 


[AN [-N,N]”"| 


6(m, k) = min § lim inf 
Noo (2N)” 


:A CZ", Ais (m,k) — plausible? ; 
(1) 


and, for any m => 1, we have that 
A(m) = 8(m, km) 


by the Zhang—Maynard-Tao theorem. Our main result is the following. 


Theorem 1.1. Fork > m > 1, we have, uniformly 


O(m) 
6(m, k) = Ae 
(< loglog =)! 

In the Maynard—Tao Theorem we know that one can obtain k,, < e“” for some 
constant c > 0. The best value known for c is a little smaller than 3.82, and Tao 
(http://terrytao.wordpress.com/, Public communication; see also [2]) showed that 
the Maynard—Tao technique cannot be (directly) used to obtain a constant smaller 
than 2. So we deduce that 


A(m) = (Mm, kn) = FOO" > CH, 
and that this can, at most, be improved to >> eo" 
methods. 

A de Polignac number is an integer h for which there are infinitely many pairs of 
primes p and p + h (the twin prime conjecture is the special case h = 2). A positive 
proportion of integers are de Polignac numbers, as follows from both results above, 
but we wish to determine the best explicit lower bounds possible. In Sect. 2, using 
quite elementary arguments, we will show that 


using the currently available 
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1 1 
AQ) = 10.) = 1 — — ) ~ 0.002830695767... 2 
@) = 50.) > = TT ( =| 0002830695767... > (2) 


p<s0 


354" 


and that this cannot be improved dramatically without improving the value of kz = 
50. We remark that this bound is surprisingly good in light of the fact that we can 
only deduce that there is some h < 246 for which there are infinitely many prime 
pairs p,p +h. 

Finally, we note that it would perhaps be more natural to consider the proportion 
A*“(m) of admissible m-tuples that are Dickson m-tuples, rather than the proportion 
of all m-tuples. To this end, in Sect.4, we determine the proportion Qaa(m) of all 
m-tuples that are admissible. Surprisingly, this question has seemingly not been 
addressed in the literature, and we prove that 


eam) 


Qaa(m) = (e’ logmy" (3) 


as mM > ©. 


2 The Density of de Polignac Numbers 


To prove a lower bound on the density of de Polignac numbers, we consider 
admissible sets B of integers that do not contain a Dickson pair. If the prime k- 
tuplets conjecture is true, then necessarily |B| = 1, while Zhang’s theorem implies 
that |B] < kp — 1. Because of this upper bound, there must be maximal such sets, 
in that, for any t ¢ B such that B U {f} is admissible, B U {7} contains a Dickson 
pair. This condition implies that t — B contains a de Polignac number, and we will 
obtain a lower bound on 6(2, k2) by varying t. To this end, for an admissible set B, 
define 7(B) to be the minimal lower density of sets A with the property that tf — B 
contains an element of A whenever B U {ft} is admissible. Moreover, for any integer 
£, set n(€) to be the infimum of 7(B) as B runs over admissible sets of size £. We 
thus have that 


A(2) = 6(2,k2) = min n(€), 
1<£<ko—-1 


and we will prove the following. 


Proposition 2.1. For any integer ¢, we have that n(t) ~ e~”/£ log £, and explicitly 
that 


psl+l1 


for any positive integer y < €—1. 
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For the particular application to A(2), we take £ € {1,..., 49} in Proposition 2.1 
to find that 


1 
8(2,ke) = | min. (6) > 0.002830695767 > 57. 


while taking y = 13 yields that 7(49) < 0.005328005328 < = so that, using our 
techniques, we can hope for an improvement in our lower bound for 6(2, kz) by a 
factor of at most about two. However, we can do better: the explicit upper bound 


given in Lemma 3.1 below shows that 6(2, 50) < ort < 0.0048. 


Proof of Proposition 2.1. We begin with the lower bound for n(£). Suppose that B 
is admissible of size £, so that for each prime p < £+ 1 there exists a residue class np 
(mod p) such that p + n, + b for each b € B. If t # —n, (mod p) forallp < £+ 1, 
then B U {t} is admissible and so contains a Dickson pair, and, as remarked above, 
t must be one of that pair, else B would have contained a Dickson pair. If x is large, 
then the number of such integers t < x is 


and we know that, for some b € B, t — b is a de Polignac number. Given t € Z, an 
integer can be written in at most £ ways as t — b for b € B, since |B| = £. Hence, 


1 1 e 
n(B) > e I] (:- -) ~ Llog el 


psl+l1 P 


as £ + oo. 

We now turn to the upper bound. We will construct sets A and B with the 
properties that B is admissible and that if ¢ is such that B U {t} is admissible, then 
there exists b = b, € B such that t — b, € A. 

Let y € {1,...,€ — 1} and set v = ¢ — y. Moreover, define r = [he? and 
n= II,<e p, and select h large so that q := hv + 1 is a prime > £. We define A to 
be the set of all integers a for which (a + 1,r) = 1 anda = 0,1,2,..., orh—1 
(mod q). The density 6(A) of A is 


a5) s=5110-3) 


psy 


Our set B will be constructed as the union of two sets, B, and Bo. For B,, we take 
B, := {bo,b1,...,by} where each b; is chosen to satisfy b; = ih (mod q) and 
b; = 1 (mod m), and we choose B; to be a set of y— 1 integers covering the residue 
classes 2,...,p—1 (mod p) for each prime p < y. We note that B := B, U By has ¢ 
elements and is admissible: it occupies the congruence classes 1,...,—1 (mod p) 
for all p < y and covers at most y classes modulo p for p > y. 
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If tis such that BU {t} is admissible, then t € 0 (mod p) for all primes p < y. We 
can write t = ih+j (mod q) for some i and / satisfying 0 < i < vand0 <j <h-1. 
Consider t—b;. This is = j (mod qg) and ¥ —1 (mod p) for all primes p < y. Hence 
t — b; € A, so that n(£) > 6(A), and the result follows. Oo 


Remark 2.1, Pintz [4] showed that there is an ineffective constant C such that every 
interval of length C contains a de Polignac number. The proof of the lower bound 
above furnishes a different proof of this result: the values of ¢ that appear are periodic 
modulo r := TL <ky D> SO, given a maximal set B, we obtain Pintz’s result with 
C=rt+maxpeg b — mingesg b. 


3 Proof of Theorem 1.1 


We begin with a strong version of the upper bound of Theorem 1.1. 


Lemma 3.1. [fk > m > 2 with k/m — ov, then 


m—1 
e’+o(1) 
d(m,k) < (=| 
og log 


m—1 m—1 


Proof. Pick q as large as possible so that (m — 1)¢(q) < k. We claim that 
A = {(X1,..-,Xm) 1X1 =X. =... =Xm Mod gq} 


is (m, k)-plausible. This is because any admissible set with k elements may take at 
most (q) distinct values modulo g, so by the pigeonhole principle, at least m of 
these values must be congruent modulo q, giving an m-tuple contained in A. On the 
other hand, q is of size > (e” + o(1)) log log a as k/m — oo, and A has 
density 1/q”~!. This completes the proof. Oo 


The proof of the lower bound is somewhat more involved. A key ingredient is 
the Lovasz Local Lemma: 


Lovasz Local Lemma. Suppose that E,...,E, are events, each of which occurs 
with probability < p and depends on no more than d of the others. If d < 1/(4p), 
then the probability that no E; occurs is at least ee, 


The input for the Lovasz Local Lemma will come from the following technical 
result. 


Lemma 3.2. Let k > m > 1. Consider a translation and permutation invariant set 
AC ((—2x, 2x] NZ)”, and set V = {n€ (—x,x] NZ: (1, [],<,P) = 1. Uf 


anv 1 
|v™| 8m(_}) 


m—1 
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and x is large enough in terms of m and k, then 


(log 3m)0™ 


A| > —————__ - 
4] 2 (4 log log A yn—1 
Before we prove Lemma 3.2, we use it to deduce the lower bound in Theorem 1.1. 


Proof of the lower bound in Theorem 1.1. Fix a large positive x and suppose that 
A C [-2x, 2x]” is an (m, k)-plausible set, yet 


; k k m—1 
|A| < (log 2m)~°"x” / (- log log ~) 
m m 


for some c > 0. If c is large enough, then Lemma 3.2 implies that 


anwvm| 
vn 8m(*!)’ 


m—1 


(4) 


where VY := {n € (—x,x] NZ: (n, [p< P) = 1}. Now select integers 7),..., m 
uniformly at random from ./%. We claim that there is a positive probability that 
the n; are distinct and that there is no m-element subset of {n;,...,m,} in A, which 
implies that A is not (m, k)-plausible. We use the Lovasz Local Lemma, in which 
we wish to avoid the events nj = nj for 1 <i <j < kand (%,...,x%m) € A for any 
choice of m elements x, ..., Xn among the integers m),...,,. There are (5) + (7) 
such events. Each event depends on no more than m of the n;, and so depends on 
no more than m(*;') + m(*1) < 2m(*1) other events. If x;,...,Xm are m out of 
the k random variables, then the probability that (41,...,%m) € Ais < 1/ (8m(‘—1)) 
by (4). We finally note that the probability that n; = n; is 1/|.Y| ~ R/(2xd(R)) as 
x — 00, which is certainly < 1/ (8m(‘~1)) for x sufficiently large. 

The Lovész Local Lemma now implies that there exist distinct elements 


n,...,Mm% € MW for which (x,...,%m) ¢ A for all subsets {x1,...,x%,} of 


{n1,...,m}. (in fact, the Lovasz Local Lemma implies that this is true for a 
proportion > e~*/ 2m” of the k-subsets of VW .) Hence, A is not an (m, k)-plausible 
set, a contradiction. oO 


To prove Lemma 3.2, we first need another result. Given an m-tuple h = 
(ty,..., hm), we denote by n,(h) the number of congruence classes mod p covered 
by /1,...,/4m. Note that 1 < n,(h) < min{p,m} and that both upper and lower 
bounds are easily obtained for some m-tuple h. 


Lemma 3.3. Let k > m > 1. There is an absolute constant c > 0 such that ifh is a 
randomly selected m-tuple from NV = {n € (—x,x] NZ: (n, [lee p) = ly and x is 
large enough in terms of m and k, then the probability that 


psk P 
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eae 
~ 16m(*~1) 


m—1 


Proof. The result is trivial if k < m’, so we will assume that k > m7’. Similarly, we 
may assume that k is large enough. Moreover, note that there are <<; |.V|""! = 
Ox>00(|V |") m-tuples (4,...,/4m) with non-distinct elements. So it suffices to 
show that if we randomly select a subset B = {bi,..., Dm} of distinct elements of 
WN, then the probability that 


I] (1 = “2 (1 = ~\" > (log 3m)" (log log ky"! 


psk P 


7 _ : 
17m(,,-1) 
By Mertens’ Theorem, the contribution of the small primes is 
B 1 —m 1 —m 
Il (1 = “ee (1 a -) <T] (: a *) < (log 3m)" 
p<m? P P p<m? P 
for some absolute constant c; > 0. If c > 2c; and we set 


re= J] (1 = ne) (1 = ~) 


m2 <p<k P P 


then we are left to show that 


1 
P(F(B) > (log 3m)? (log log ee) < Tim) 


Taking logarithms, we have 


m—n m m—n 1 
EON arent) heraeadC) 


m2 <p<k P 


sin SH 40(L). 


m2 <p<k P 
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where X,(B) = 1 if n,(B) < m, and X,(B) = 0 if n,(B) = m. Therefore, if we set 
X(B) = Yon? <p<k Xp(B)/p and we assume that c is large enough, then it suffices to 
show that 


1 
P (x@) > 5 log log(3m) + log log log k) < Tm(=) 


m—|1 


In turn, the above inequality is a consequence of the weaker estimate 


P (X(B) > log log(max{m’, m log k}) + c’) < a, (5) 
17m( ) 


m—| 


where c’ is a sufficiently large constant. 
The X, can be viewed as independent random variables as we run over all 
possible sets B. As in the birthday paradox, the probability that X, = 0 is 


Gi a aa] age 
1 — —— }{ 1- —— ]...{ 1-— —— } = exp {| -————— 
p-1l p-1 p-\1 2p 
we 
=1+0(7). 
P 


For any r, if p < r, then we trivially have that E[e?/?] < e’/?, and otherwise 


E[e"*/?] = P(X, = 0) + P(X, = Ie”? = 14 P(X, = le” — 1) 


<oo(0()) 


Therefore, for any values of s and r, we have that 


e“P(X>s)< z[e"*] = I] B[e"Xr/P) = Tle” Tle” 


psk psr p2r 


< exp (rloglogr + O(r) + O(m’/ log r)). 
Thus, if r > m7’, then setting s = loglogr + c’ for c’ sufficiently large, we find that 
P(X > loglogr+c') <e”. 


Substituting r = max{m?, m log k} establishes (5) for k large enough, thus complet- 
ing the proof of the lemma. Oo 


Proof of Lemma 3.2. We partition the elements of [—2x,2x]” into translation 
classes, putting two elements in the same class if and only if they differ by 
(€,£,...,£) for some integer €. Each translation class T intersecting [—x, x]” 
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contains at least 2x elements of [—2x, 2x]” (and at most 6x). The main idea of the 
proof is that if we can find at least U elements of AN .4”” whose translation class 
has at most M elements inside .”, then |A| > 2x-U/M. 

Note that n,(h) is fixed over all h € T, so we denote it by n,(T). The number 
of integers £ for which h + (£,£,...,£) € Wis < 3x] [<k (1 - “2 for x 


large enough. If we set R = II, <; P, then Lemma 3.3 implies that the proportion of 
elements of .V” for which 


I (: _ “ee < (“ey (log 3m)°" (log log ky"! (6) 


pk P 


is< 1 / 16m Si) provided that x is large enough, with c being an absolute constant. 
Since A contains at least |.”’"| jf 8m Ce elements in .’” by assumption, we 


find that A contains at least |.”""| N 16m(‘1) for which (6) holds. Therefore, we 
conclude that 


Iv o(R)\" = 
[Al ee —] (log 3m) (log log k)” 
24m(",_1) R 
x" (log 3m)” 
(£ log log ae 


for x large enough, and the result follows. Oo 


4 The Number Of Admissible k-Tuples 


Our goal in this section is to show relation (3). Given a prime p, we say that an 
m-tuple is admissible mod p if its elements do not occupy all of the residue classes 
mod p, so an m-tuple is admissible if and only if it is admissible mod p for every 
prime p. By the pigeonhole principle, any set of m integers is admissible mod p if 
Pp > m, so to test for admissibility we need only work with the primes p < m. This 
implies, using the Chinese Remainder Theorem, that 


daa(m) = | | exa(m,p). 


psxm 


where Qaa(m, p) denotes the proportion of m-tuples that are admissible mod p. 

If m/logm < p < m, then we note the trivial bounds (1—1/p)” < Qaa(m, p) < 1, 
with the lower bound coming from counting m-tuples whose elements are not 0 
mod p. Therefore 


Ly" mlog log m 
1> []  eatmp)> I] (1-5) gO 


m/logm<p<m m/logm<p<m 
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It remains to compute the contribution of primes p < m/logm. It is not difficult to 
determine an exact expression for Qaq(m, p) using an inclusion—exclusion argument: 
the probability that the elements of an m-tuple h belong to a given subset of p — 1 
residue classes is (1 — ie There are (') choices of the p — | residue classes. If the 
elements of h belong to exactly p—2 residue classes mod p, then h was just counted 
twice and so we need to subtract the probability of this happening. That probability 
is (1 — a and there are (5) choices of the p — 2 residue classes. Continuing in the 
way, we find that 


2S 


j=l 


PQ 


We note that ratio of two consecutive summands in absolute value is 


ne er I ye sya 
( 0-4)" | papal) pap pa2). p= 


> 2exp | — ~ lop - | > 2logm 
p= 


Qaa(m, p) 


lI 
1 
wD 
ee 
a 
pa 
=. 
——~ 
— 
| 
|~ 
ee 
~ 


for all p < m/ log m. Therefore, we deduce that 


cainn)=n(1-2)"(140(242)) 
Dp logm 


which implies that 


Ol m log logm 


m 
OC m og log m ) 1 e logm 
m) = e7> bem | 1— = ‘ 
Oaal ) P (ev log yn 


p<m/logm 4 


which proves (a quantitative version of) relation (3). 


5 A Better Density in the Continuous Case 


Analogous to the discrete question considered here, one can also ask about the 
continuous version, i.e. the set of limit points & of the set of values of (Dn+1 — 
Pn)/ log Pn, Where p, is the nth prime. One can deduce from a uniform version of 
Zhang’s Theorem that for any 0 < 6B; < Bo <... < By with k = ky there exists 
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1 <i<j<ksuch that B; — B; € ; in fact, this was done by Banks, Freiberg, and 
Maynard [1]. By a small modification of the argument used to prove Corollary 1.1, 
one can then show that & NM [0, T] has Lebesgue measure = T/(k — 1). Somewhat 
remarkably, Banks, Freiberg, and Maynard were able to go beyond this by showing 
that, given a sufficiently large k-tuple partitioned into 9 equal parts, at least two 
of these parts must simultaneously represent a prime. From this, they deduce that 
£1 [0, T] has Lebesgue measure 2 T/8. 
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A Note on Helson’s Conjecture on Moments 
of Random Multiplicative Functions 


Adam J. Harper, Ashkan Nikeghbali, and Maksym Radziwill 


To Professor Helmut Maier on the occasion of his 60th birthday 


1 Introduction 


In this note we are interested in cancellations in sums of multiplicative functions. 
It is well known that 


MG) = > ea) = 0G") 


nsx 


is equivalent to the Riemann Hypothesis. On the other hand, it is also a classical 
result that M(x) > x!/?-® for a sequence of arbitrarily large x. It is in fact conjectured 
that 


M(x) 


5 
./x(log log log x) 4 


lim = +B 
X00 


for some constant B > 0 (see [21]). 

Wintner [24] initiated the study of what happens for a generic multiplicative 
function which is as likely to be 1 or —1 on the primes. Consider f(p), a sequence 
of independent random variables taking values +1 with probability 1/2 each 
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(i.e., Rademacher random variables), and define a multiplicative function supported 
on squarefree integers n by 


fa) =| [ FO). 


pin 


We shall refer to such a function as a Rademacher random multiplicative function. 
By the three series theorem, the Euler product F(s) := 11,0 + f(p)p *) converges 


almost surely for his > 5. From this Wintner deduced that 


YF (n) < x'/?+* almost surely (a.s.) 


n<x 


Since then the problem of the behavior of }°,,-,f(n) has attracted considerable 
attention [7, 11-13, 17, 18]. A closely related model is to let f(p) be uniformly 
distributed on the complex unit circle (i.e., Steinhaus random variables), and then 
define f(n) := TT ,I10 f (p)* for all n. We shall refer to such a function as a Steinhaus 
random multiplicative function. 

Very recently mean values of random multiplicative functions arose in connec- 
tion with harmonic analysis. In his last paper Helson [16] considered the question 
of generalizing Nehari’s theorem to the infinite polydisk. He noticed that the 
generalization could be disproved if one could show that 


i 77 
lim = J | Yon" |dt = o( VN). 1 
Hise f [pete ool © 
Using Bohr’s identification, we have 
2q 1/2q 2q 1/2q 


dt (2) 


YF (n) 


n<N 


n<N 


for all 2g > 0, and with f(m) a Steinhaus random multiplicative function. 
Therefore (1) is equivalent to 


= o(VN), (3) 


Yif@) 


n<N 


vi 


with f(m) a Steinhaus random multiplicative function. Helson justified his belief 
in (1) by observing that N(it) := >°,-yn" is the multiplicative analogue of 
the classical Dirichlet kernel D(@) := Yin) <y ern Since ||D\|) = o(||DI|2) 
Helson conjectured that the same phenomenon should happen for the “multiplicative 
analogue” N(it). Another reason one might believe the large cancellation in (1) to 
be possible is that on the +-line one has 
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i a 1 
oo T , | » ni/2+it 
n<N 


as follows from the work of Bondarenko et al. [3]. This bound is stronger than one 
would expect assuming only squareroot cancellation, which would suggest a size 
more like log!/? N. 

Recently Ortega-Cerda and Seip [22] proved that Nehari’s theorem doesn’t 
extend to the infinite polydisk. However the problem of establishing (1) remained. 
There are now reasons to believe that (1) is false. In a recent paper Bondarenko and 
Seip [2] showed that the first absolute moment is at least WN (log N)~*+°) for some 
small 6 < 1. Our primary goal in this note is to improve further on the lower bounds 
for (2). Our results also work for Rademacher random multiplicative functions. 


dt < log!/4*+°) ny, 


Theorem 1.1. Let f(n) be a Rademacher or Steinhaus random multiplicative 
function. Then, 


x 
4 


> VN(loglog Nyt) 


iD 


n<N 


as N > o. 


The main input in the proof of Theorem 1.1 is the work [12] of the first named 
author on lower bounds for sums of random multiplicative functions. Using Hélder’s 
inequality, we can extend the result of Theorem 1.1 to L’ norms. 


Theorem 1.2. Let f(n) be a Rademacher or Steinhaus random multiplicative 
function and let 0 < q < 1. Then, 


2q 
> N4i(loglog N) et), 


TG 
vi 


Yif@) 


n<N 


Theorems 1.1 and 1.2 suggest it is rather unlikely that Helson’s conjecture is true. 
See Conjecture 1.1, below. 

In addition to the above results, we establish an asymptotic estimate for the 2k-th 
moment when k is a positive integer. 


Theorem 1.3. Let k € N. Suppose that f(n) is a Steinhaus random multiplicative 
function. Then, as N — od, 


Yif@) 


n<N 


x 
4 


2k 
2k —2 . 
- a fee -ceyeN* log Ny®", 


where y; is the volume of Birkhoff polytope &y;, defined as the (k — 1)* dimensional 
volume of the set of (ujj) € Re such that 
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for eachi<k: ss ujj = 1 


I<j<k 


and for eachj <k: > uj = 1, 


1<i<k 
and 
1\z2 eae 
o«=[T[(--) -(+ m1) 
: I] P 2 pe 


Note that A; is a (k — 1)* dimensional object embedded in a k* dimensional space. 
The (k — 1)? dimensional volume of A, is equal (see, e.g., Sect. 2 of Chan and 
Robbins [6]) to k*~! times the full-dimensional volume of the set of (ui j)ij<k—1 € 
R¢-)” such that, for all ij<k-1, 


y > uij < 1 and Y> ij < 1 and > ujj > k— 2. 


jsk-l i<k-1 ij<k-l 


The latter is how the volume of A; will actually arise in our calculations. 

It is worth pointing out that finding a closed formula for the volume of the 
Birkhoff polytope A, is a notorious open question and would be of interest in 
enumerative combinatorics, statistics, and computational geometry (see [23]). There 
are evaluations of Vol(4,) for small values of k (see [1, 6]), 


ae 352-43 4718075 - 54 
5m + VOU Ba) = —35,— » Wol(Bs) = ——— 


Vol(43) = 


pote 


and an asymptotic formula is known to hold [5] 


ko IP ok? 


/ 1/3 
Vol(Ax) i, 27e . ~ Qnyk rf 


k> oOo. 


In addition the asymptotic behavior of the Euler product cx, is known (see [9, 
Proposition]), 


log c, = —k* log(2e” log k) + o(k’) 


where y is the Euler—Mascheroni constant. 
We note that Conrey and Gamburd [8] compute the even integer moments 


2k 
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on the 5-line, and unsurprisingly the answer is extremely similar to Theorem 1.3 


(in particular an Euler product and a volume related to the Birkhoff polytope again 
appear). Conrey and Gamburd discuss the connection between their result and the 
moments of certain truncated characteristic polynomials of random matrices. In 
general, it seems reasonable to say that the arithmetic factor c, reflects local count- 
ing modulo different primes in the moment computation, whereas the geometric 
factor y;, reflects global counting of tuples m,,...,n, and my ,...,m, subject to the 
truncation n;,m; < N. 

We deduce Theorem 1.3 from a general result of La Bretéche [4] on mean values 
of multiplicative functions in several variables. Theorem 1.3 has also been obtained 
independently by Granville and Soundararajan (unpublished), and also very recently 
(and independently) by Heap and Lindqvist [15]. Additionally, Theorem 1.3 sheds 
light on the conjectural behavior of moments of the theta functions, 


9.x) = Do xier™? 


n>1 


with p > 3 a prime, and y an even Dirichlet character modulo p. The rapidly 
decaying factor em x!p essentially restricts the sum to those n less than about ./p 
(if x = 1, say), and the average behavior of y(n) with n < p!/? is similar to that 
of a Steinhaus random multiplicative function. Therefore Theorem 1.3 leads to the 
conjecture that 


> 100, 0P*~ Ciph*Cogp)" as p> 00. 
Aer 
In unpublished recent work the same conjecture was stated by Marc Munsch on the 
basis of his lower bound for moments of @(1, 7). Louboutin and Munsch [19] prove 
the conjecture for k = | andk = 2. 
Combining Theorems 1.2 and 1.3 suggests the following “counter-conjecture” to 
Helson’s claim (1). 


Conjecture 1.1. If f(n) is a Steinhaus random multiplicative function, then we have 


asN > oc, 


2 
: C(q)N4, forO<q<1l 


C(g)N(logN)'-)", for 1 <4. 


Yif@) 


n<N 


“4 


Conjecture 1.1 suggests a possible line of attack on the problem of showing that for 
a positive proportion of even characters y modulo p, we have 0(1, ¥) # 0. This 
would be based on comparing the first and second absolute moments, i.e. 


d> 180.01 and D2 00, *P. 


x mod p xX mod p 
x even x even 
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We emphasize that we do not have a lot of evidence towards Conjecture 1.1 
when q ¢ N, and perhaps especially when 0 < q <_ 1, and it is conceivable 
the behavior could be more complicated. However this is the simplest possible 
conjecture respecting the information that we now have. In addition for g > 1 it 
perhaps seems unlikely that the distribution of the tails of )°,f(n) (in a large 
deviation regime) fluctuates so significantly that it would affect the exponent (q—1)? 
of the logarithm when g goes from an integer to a fractional exponent. We also note 
that if we could obtain the order of magnitude for the 2g-th moment suggested by 
Conjecture 1.1 for g = 5, then since we know it trivially for g = 1 a simple 
argument using Holder’s inequality (as in the proof of Theorem 1.2, below) would 
establish the order of magnitude suggested by the Conjecture for all 0 < q < 1. 

Finally, following a question from the referee, we noticed that we can extend 
Theorem 1.3 to the Rademacher case. We omit the simple cases of k = 1,2 in the 
theorem below, since both are different from the case k > 3. 


Theorem 1.4. Let f(n) be a Rademacher random multiplicative function. Then, for 
k => 3 an integer, as N > ~, 


a( Yr), ~ Cy N*/2 (log NyKE3)/2 


n<N 


with C;, > 0 constant. 


Similarly as in Theorem 1.3 the constant C; splits into an arithmetic and geometric 
factor. The interested reader should have no trouble working out the details. 
Theorem 1.4 has also been obtained independently by Heap and Lindqvist [15]. 

At first glance it may seem strange that all the moments here (including the odd 
ones) are non-trivially large, but that is because in the Rademacher case there is no 
distinction between a term and its complex conjugate (and similarly if one calculated 
an expression like E pa <ni (n)|"* ( <nt (n)) in the Steinhaus case, this would 
be non-trivially large provided k > 1). Note also that the moments are rather larger 
in the Rademacher case than the Steinhaus case, again because everything is real 
valued and so the terms exhibit less cancellation. 


2 Lower Bounds for the First Moment 


In this section we shall first prove the following result. 


Proposition 2.1. Let f(n) be a Rademacher random multiplicative function. There 
exist arbitrarily large values of x for which 


a) vA 


v 


a 
~ (log log x)3 +e) 


nsx 


The same is true if f(n) is a Steinhaus random multiplicative function. 
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The above proposition is actually a fairly straightforward deduction from the 
work of Harper [12]. However, it is a bit unsatisfactory because it only gives a lower 
bound along some special sequence of x values. With more work we can correct this 
defect, as in the following theorem announced in the Introduction: 


Theorem 2.1. Let f(n) be a Rademacher random multiplicative function. Then for 
all large x we have 


~ dog log x)3+e() ° 


nsx 


b> Ff (n FE gle 


The same is true if f(n) is a Steinhaus random multiplicative function. 


The proof of Proposition 2.1 has two ingredients. The first is the observation, 
essentially due to Haldsz [11], that one can almost surely lower bound an average 
of pas eyh (n)| in terms of the behavior of f(7) on primes only: more specifically, in 
the Rademacher case we almost surely have that, for any y > 2, 


i. [acer | lamsd VY sy sup ap exe (Oras log t — log log(t + 2)/2). 
1 


z3/2+1/logy 1/2+1/logy 


Here the implicit constant in the > notation is absolute. The reader should note that 
the presence of the supremum over f will be very significant here, since at any fixed 
t the expected size of the right-hand side would be too small to produce a useful 
result (about log!/ + y, rather than about log y which is what we need). 

The second ingredient is a strong lower bound for the expected size of the right- 
hand side, which we deduce from the work of Harper [12]. We quote the relevant 
statements from Harper’s work as a lemma now, for ease of reference later. 


Lemma 2.1 (See § 6.3 of [12]). Jf (f(P))p prime are independent Rademacher 
random variables, then with probability 1 — o(1) as x > oo we have 


fp) cos(tlogp) 
Pea 


sup pi/24+1/logx 


1<1<2(log log x)? 


> loglogx — log log log x 
Pp 


—O((log log log x)*/*). 


Tf (f (P))p prime are independent Steinhaus random variables, then with probability 
1 — 0(1) as x > cw we have 


ig CO, itor 


p}/24+1/logx 2 p)+2/logx ) = log log x _ log log log x 


1<r<2(log log x)? Pp 


—O((log log log x)*/*). 
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The first statement here is proved in the last paragraph in § 6.3 of [12] (noting 
that the quantity y there is log’ x). The second statement can be proved by 
straightforward adaptation of that argument, the point being that the expectation 
and covariance structure of these random sums in the Steinhaus case are the same, 
up to negligible error terms, as in the Rademacher case, so the same arguments 
can be applied. (See the preprint [14] for an explicit treatment of some very 
similar Steinhaus random sums.) The argument in [12] is quite involved, but the 
basic aim is to show that, for the purpose of taking the supremum, the sums 


SUP] </<2(log log x)? me eee behave somewhat independently at values of f that 


are separated by >> 1/log.x, so one has something like the supremum over log x 
independent samples. 

To prove Theorem 1.1 we introduce a third ingredient, namely we show that 
Ap ae f (n)| may itself be lower bounded in terms of an integral average of 


q Pet 0) , as follows: 


Proposition 2.2. Let f(n) be a Rademacher random multiplicative function. For 
any large x we have 


YF (n) 


nsx 


vi 


Ss af oi | net) dz 
logx Jy JZ z 


The same is true if f(n) is a Steinhaus random multiplicative function. 


This uses the multiplicativity of f(7) in an essential way (as does the proof of 
Proposition 2.1, of course). 

Theorem 1.1 then follows quickly by combining Proposition 2.2 with the proof 
of Proposition 2.1. 

As the reader will see, the proof of Proposition 2.2 is based on a “physical space” 
decomposition of the sum )°,_.,.f(n), which is somewhat related to the martingale 
arguments of Harper [13]. This is unlike the other arguments above, which work 
by establishing a connection between the integral average of >>, f(n) and its 
Dirichlet series }*, f(n)/n* (on the “Fourier space” side). 


NX 


2.1 Proof of Proposition 2.1 


The proof of Proposition 2.1 is slightly cleaner in the Rademacher case, because 
then f(p)* = 1 for all primes p. So we shall give the proof in that case first, and 
afterwards explain the small changes that arise in the Steinhaus case. 

We know from work of Wintner [24] that almost surely )>—,. f(n) = O¢(x!/?**). 
Consequently, by partial summation the Dirichlet series F(s) := >, f(n)/n* is 
almost surely convergent in the half plane t(s) > 1/2, and then by term by term 
integration it satisfies 
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F(s) = sf Lseh() 5 H(s) > 1/2. 


zt 


In particular, F(s) is almost surely a holomorphic function on the half plane t(s) > 
1/2. 

On the other hand, since f(n) is multiplicative we have for any ‘X(s) > 1 that, in 
the Rademacher case, 


F(s) = T1(1+ 22) =ex( Sto (1+) ) 
P Pp 
fp) _ Foy" (es = f(p)' 
=o DO eh) 
p 


k>3 


Therefore in the Rademacher case we have 


Pensel 4, f(p) _ floy" (-1)**! f(r) 
fe SL ae a 
P 


oe 2 


at least when 3i(s) > 1, since both sides are equal to F(s). But all the sums involving 
p* and p* are clearly absolutely convergent whenever Si(s) > 1/2, and therefore 
define holomorphic functions there. In addition, for any fixed s with H(s) > 1/2 
the series ae f o is a sum of independent random variables, and Kolmogorov’s 
Three Series Theorem implies it converges almost surely. Since a Dirichlet series is 
a holomorphic function strictly to the right of its abscissa of converge, we find that 
almost surely }> z up is a holomorphic function on the half plane 9i(s) > 1/2, and 


so almost surely we have, for all Sis > ae 


pa Ay ae : 1B ae Ny 
sf a o(L ae +O ) a ). 


k>3 


Next, if we write s = o + it and take absolute values on both sides then we find 
that, almost surely, 


~7o+T 


= 9( ORs ae re ye) 


k>3 


i i XnecfOI 


RK =i RK 2.,-2it 
= exp (> (F)p ey Fey") 


= + o(1)), Vo > 1/2. 


P 
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If we take o = 1/2 + 1/ logy for a parameter y > 2, and we note that then |s| = |¢| 
provided t > | (say), we have almost surely that for all y > 2, 


tr > neh D)| K(f (p)p~*) i RF (p)2p~2") aie: i) 
1 


23/2+1/logy dz > sup exp ( » p!/2+1/logy 2 p'+2/logy 
_ P P 


In the Rademacher case the first sum over p is = Saran and (since f(p)* = 


1) the second sum over p is } > pee = Mlogé(1 + 2/logy + 2it) + 
O(1), where € denotes the Riemann zeta function. Standard estimates (see, e.g., 
Theorem 6.7 of Montgomery and Vaughan [20]) imply that | log €(1 + 2/logy + 
2it)| < log log(t + 2) + O(1) for t => 1, so we have almost surely that for all y > 2, 


os el CD) f(p) cos(t log p) 
/ 73/2+1/ logy dz > et exp ( a ~ pl/2#i/losy = log t = log log(t + 2)/2). 
= Pp 


(4) 


(The above argument and inequality (4) are essentially due to Halasz [11], and 
are also related to the arguments of Wintner [24]. The only small difference is that 
Halasz restricted to 1 < t < 2. See Appendix A of Harper [12] for a presentation 
similar to the above.) 

Now to prove Proposition 2.1, note that for any large parameters x and xp < x, 
we have 


a | net | ss 1 [ y [Dencel (| 


sup > Z, 
eG Jz logx Jy 3/2+1/losx 
* al dz 00 dz — 
since J. atijm: S i 1 yes = logx. Then by Cauchy—Schwarz we always 


have E| al | < ./z, so 


[ nse) 5, za ia Xnsef(0)| dz 
P z /2+1/logx 1 73/2+1/logx 


i dz a dz 
1 zi+1/logx af zi+1/logx 


i | en<cf(n)| 
1 


23/2+1/logx 


log x 


= 1/1 : 
Ps ogx 


dz — log x9 — 
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In particular, if we choose xy = ev losx and x, = efl08*) loglogx say, then we have 


A _f(n ] O° 4s 2 
sup inecF (I > i Se dz— . (5) 
ipezen JZ logx 23/2+1/ log x log x 


Finally, in the Rademacher case Lemma 2.1 implies that, with probability 1—o(1) 
as X > 00, 


tl 
sup papi ea) > log log x — log log log x 


1<t<2(log log x)? “> piers 
— O((log log log x)*/*). 


This implies that with probability 1 — o(1) one has 


f(p) cos(tlog p) log x 
ET — logt — log log(t + 2 2) > a 
= i ( dX pi/2+1/logx 08 08 og( )/ ~ (log log x)3 +e)’ 


and then by the Halasz type lower bound inequality (4) we deduce 


iM ) Lnccf(m)| = logx (6) 
1 


73/2+1/ log x a (log log x)3 +e) : 


Proposition 2.1 follows in the Rademacher case by combining this with (5). 

In the Steinhaus case the initial argument of Wintner [24] still works, so the first 
change that is needed in the preceding argument comes in the expression for the 
Euler product F(s), which for i(s) > 1 is now 


Il De = exp(- Dve(1 -f2)) 


P 
fp) FO? , > 1 yf 
OD ss Die: oa e> me) 
Pp 


Notice this is the same as we had in the Rademacher case, except now there are 
no alternating minus signs in the final exponential. The argument using the Three 
Series Theorem, etc. then continues as in the Rademacher case to yield that, almost 
surely, 


sf Rens ge = x(t Ee we Se -) 


k>3 Pp 


F(s) 


II 


II 


V R(s) > 1/2. 
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Putting s = 1/2+1/logy-+ it and taking absolute values on both sides, we deduce 
that almost surely, 


~ opt) Rf (p)p~") ] Rf (p)-p—2") 
i; is Tiay & > sup exp (DC piztiiogy + 2 p27 085 ) — logs) 
> - 


Vy>2. (7) 


Since we don’t now have f(p)? = 1, we cannot remove the contribution of the 
prime squares using estimates for the zeta function. However, by the Steinhaus case 
of Lemma 2.1 we still have that, with probability 1 — o(1) as x > oo, 


REO), URFOYP2") 


sup ( + 
dticeined? = pi/2+1/logx 2 pi*2/ logx 


) > log log x — log log log x 


— O((log log log x)?/*), 
and therefore with probability 1 — o(1) we have 
nae ‘63 (ae (p)p-") | IRE ae) tog) > log x 
S P — plat tiogx "2 pi¥2/ioex si} (log log xo)" 


Combining this estimate with (7) and (5) then proves Proposition 2.1 in the 
Steinhaus case. 


2.2 Proofs of Theorem 1.1 and Proposition 2.2 


Proof (Proof of Theorem 1.1, Assuming Proposition 2.2). In view of Proposition 2.2, 
it will suffice to prove that for all large x we have 


[( | %rse/ (1) dz _logx 
1 


Jz z ~ (loglog.x)3+0) ” 


However, for any large parameter y we have 


dz 


j* Cea) 2. ee 
1 1 


ase 2S 73/2+1/ logy 


log y [ ‘| LnsefOl | 
Zs 
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se es E| Dy<: 
in view of the lower bound /, = Lencet lag > 


= aosioe say Obtained in (6). By 


Cauchy—Schwarz we always have E| >°,,<.f(n)| < ./Z, so the subtracted term here 
is at most 


dz _—ilogy 
We zitl/logy ~~ (/x)1/ losy’ 


If we choose log y somewhat smaller than logx, say logy = (log x)/(100 log log 
log x), we deduce that 


i o| en<et(™)|\ dz a log x loge log x 
1 Jz z  (loglogx)3+°) ~— (loglogx)5° (log log x)3 +)’ 


as required. 


Proof (Proof of Proposition 2.2). The first part of the proof again differs slightly 
depending on whether we are in the Rademacher or the Steinhaus case. We will first 
work in the Rademacher case and then explain the small changes needed in the other 
situation. 

Let A; := >¢,-,f(n). If we let P(n) denote the largest prime factor of n, we have 


YA@=>)> DY FO=dY FO YD) fer, 


n<x PSX n<x,P(n)=p psx m<x/p,P(m)<p 


since f is multiplicative. Here the inequality P(m) < p in the final sum is strict 
because f is supported on squarefree numbers. Notice here that if p > ./x then 
x/p < ./x < p, SO we automatically have P(m) < p in the inner sums over m. Thus 
we can rewrite things slightly as 


YiM= YL fe YM fm+>¥ fo Yo fH 


nsx J/X<psx m<x/p psJ/x m<x/p,P(m)<p 
a f(P)Ax/p al By, 
/X<pSx 


say. Notice also that the random variables A,/, and B, are independent of the f(p) 


for /x<p<x. 
We shall introduce a penultimate piece of notation, by defining the random 
variable 


+ f(P)Axjp- 
/X<pSx 
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Finally, let € be a Rademacher random variable that is independent of everything 
else. 

Now since the (f(p)) /z<p<x are symmetric random variables independent of B, 
and the A,/,, it follows that 


Sf) = YS > f@MAyp + Br Se Y> f@)Ayp + Be 


nsx J/X<pSx /x<psx 


d Been ene sao : or 
where = denotes equality in distribution. Then if we condition on the values of 
B,, C,, we find the conditional expectation 


Yi € ey f(P)Ax/p + B, By, Cy 
/x<psx 


= (1/2)|C, + By| + (1/2)| — Cy + By| = |Cil, 


by the triangle inequality. Now if we average over values of B,,C,, and use the 
Tower Property of conditional expectations (the fact that the expectation of a 
conditional expectation is the unconditional expectation), we obtain 


C,|. 


vy 


Yif@) 


nsx 


= Eye » f(P)Ax/p + By =F 
Jx<psx 


On recalling the definitions of C, and A,/,, we see we have proved the following: 


Lemma 2.2. For all large x we have 


YF) 


nsx 


>E| S) fo) dS fm). 


J/x<psx msx/p 


(In the Steinhaus case one has a weak inequality P(m) < p in the definition of 
B,, since f is totally multiplicative, but this makes no difference to the argument just 
given. Instead of choosing € to be a Rademacher random variable one can choose € 
to be uniformly distributed on the unit circle, and then one obtains exactly the same 
conclusion in Lemma 2.2.) 

Since the f(p) are Rademacher or Steinhaus random variables independent of the 
“coefficients” )-, aol (m) = Ay/p, an application of Khintchine’s inequality (see, 
e.g., Gut’s textbook [10]) yields that 
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1G 


2 
S* Fe) >> fi)| > Y> | SS fan) 


J/x<psx m<x/p J/x<p<x |mSx/p 


It would be nice if we could find a way to exploit this (sharp) bound with the squares 
still in place on the inside, but to prove Proposition 2.2 we shall trade them away in 
order to remove the intractable squareroot. Thus by the Cauchy—Schwarz inequality 
and the fact that Di jazeee 1/p = log 2 + o(1) we have 


S> |S fa) 


/x<p<x |mSx/p 


¥ p> f(m)| < 


J X<psx P \mx/p 


V/x<psx P 


2 


«| YS | fm 


/x<p<x |mSx/p 


Combining this with the above, we deduce: 


Lemma 2.3. For all large x we have 


i) 


nsx 


Ia logp 6 
> Se ee Y fim) > = DY FB =| S) fm). 


S<PSX msx/p J/x<psx m<x/p 


We have now almost finished the proof of Proposition 2.2. If we have two primes 
z<p<p' <z+2/log! x for some /x < z < x, then 


Al 


> fm)|-E|] 5° fan)|] < S> f(m) 


m<x/p m<x/p! x/p!<m<x/p 


< (@ ile oa +1 
ee sue oe es 
p p’ plo "x 


by the Cauchy—Schwarz inequality and orthogonality of the f(m). And we see 


1000 
logx Jr /P plog x 
Xx 1 lo x 
— 3 &P K JX 


log x logx’ 
ES ae /P g 
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which will make a negligible contribution in Proposition 2.2, so in Lemma 2.3 we 


may replace each term E |)_, Sind (m| by an averaged version 


=| S° fom)| dr. 


m<x/t 


log! 7 a 
P Pp 


Since we know that primes are well distributed in intervals of relative length 1 + 
1/ log! x (with density 1 when weighted by log p) we can rewrite Lemma 2.3 as 


Yif@) 


n<x 


vy 


1000 ,. D(1+1/ log! x) 
| 


1 log 
> 1 
logx » vee p 


: dt 
Xe, =) S° fim) Fi 


m<x/t 


x 


, dt 
E| Ds fem)! =. 


m<x/t 


logx J fz 


Proposition 2.2 now follows by making the substitution z = x/t in the integral. 


3 Lower Bounds for small Moments: Proof of Theorem 1.2 


The proof is a very simple argument using the Cauchy—Schwarz inequality and 
Holder’s inequality. 
Indeed, for any 0 < g < | we have 


s| reo] <8 [| Deel") -e[] reo) 7” 
n<N n<N n<N 
cal oF) a aff 


Since E| Dey f(m)|? < N and E| YD eyf@| = VN/(loglogN)3*?™, by re- 
arranging we obtain the lower bound 


[| Dre 


n<N 


2q 
| > N4(loglog N)~ ore), 
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4 Asymptotics for Even Moments: Proof of Theorem 1.3 


Note that 
: 2k : ee 
rf = Yo Eyer)... fowF ony. Fem) 
n<X 14 ,...,MkKSX 
M|,...,Nk<X 
= >. 1. (8) 
N14. SX 
M,...,Ng<X 
11... Nk=M] ...Mk 
Now 


a(n, vee, Mk, M,...-, mk) = Lait apentiant 


is a multiplicative function of several variables! and our problem reduces to 
understanding the mean value of 


= S] Sk , Wi Wk 
ny, : n m, wigs mM, 


nN nyny..n=n 7°" Ko mymz...mg=n 


is absolutely convergent for is;, tw; > 5 and moreover it factors as 


kok 
H(s1,....54. Wis... we | [] [Gi + wp) 


i=1j=1 


Tn other words 
g(m,..., Nk, M,..+, Mg) BU, «++ Uk, Vig Vk) = SCM UI, MEU, MV, «. ., MVE) 


for any natural numbers n;, m; and u;, v; whose least common multiples are coprime. 
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with H(s),...,5%,W1,---,Wx) absolutely convergent in the region Nts;, iw; > i. In 
addition a direct check shows that 


Moon bda T1034 LE) on 


P 


Therefore the main result of La Bretéche [4] is applicable with the k* linear forms 
LOD (51.0.0, Se. Wis ee We) = + w; with 1 < i,j < k. We note that the rank of 
the collection of linear forms £“) (inside the space of all C-linear forms on C*) is 
2k — 1. Therefore it follows from La Bretéche’s result that (8) is equal to 


(1 + 0(1))C,X* (log X)*~-Y), 


Using Théoréme 2 in La Bretéche’s work allows us to recover the precise value 
of C;. Indeed, according to Théoréme 2 in [4] we get that (8) is equal to 


(1 + o(1))H(5,.... 3) Vol(Ax(X)) 


where A;(X) is a subset of [1, oo) corresponding to tuples (a;;) € [1, oo) with 
1 <i,j < k such that 


foreachj <k: I] aij <X 


1<i<k 


and for eachi <k: I] aij <X 
l<j<k 


Therefore it remains to understand the asymptotic behavior of 
Vol(A;(X)) 


as X —> oo. Surprisingly, this is somewhat involved, and the rest of the proof is 
devoted to that. 


Proposition 4.3. Let k > 2 be fixed. Then, 
2k —2 
Vol(A,(X)) ~ (; oe - Vol By)» X* + (log X)\&—-” 
where Vol(&,) corresponds to the (k — 1)* dimensional volume of the Birkhoff 


polytope By, C R®, 


The proof of the Proposition depends on the following Lemma. 
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Lemma 4.4. Let n > 1 be fixed. Then as X — oo we have 


: 2n nN 
ts aa spctoax XP ( min(xi SP ee ns VE APs + yn)) dx .. dn ~ (?"\x . 


0<y1,....Yn<log X 
Proof. Making the substitutions v; = log X — x; and w; = log X — y; in Lemma 4.4, 
we See the integral there is the same as 
n —_— 
Xx Then 7 cli exp ( max(vj +...+0U,,wi+...+ wn) dv ...dWp. 


0<w1,....Wn<log X 


Here we can extend all the ranges of integration up to positive infinity, at the cost of 
a multiplicative error term 1 + o(1). Then by symmetry 


O<w},....Wn 


and making the further substitution v = v; + ...+ v, in the integral, we see the 
above is 


lo) 
= 2 | e” (/ dv... -dvy-1) (/ dw. dn) dv 
0 ut... bv;—1 <u wit+...-wy»Sv 
lee) 
= ah ey (/ dv .-diy-1) (/ dw, dn) dv. 
0 vyt...+v,—1<1 wi+...Fwy,<l 


Here the two integrals in brackets are simply the volume of the standard n — 1 
simplex and the standard n simplex, which are well known to be 1/(m — 1)! and 
1/n!, respectively. Therefore the above integral is equal to 


2 = 2n—1 2n 
—a | ey" dy =2 = i 
(n— 1)!n! Jo n n 


We conclude that the integral in the statement of Lemma 4.4 is equal to (as X —> oo), 


(1 + o()) ()« 


as claimed. 
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We are now ready to prove the Proposition, and thus finish the proof of 
Theorem 1.3. 


Proof (Proof of Proposition 4.3). Notice first that, if we set u;; = log a;;, and if we 
write 


for all i,j < k, then we find 


Vol(Ax(X)) = / 


(wip) <ij<kE 0,00)" ‘cj ,riSlog X Vij<k 


To prove the proposition we shall obtain upper and lower bounds for the integral on 
the right that are asymptotically equal. 

For convenience of writing, we start by introducing a little more notation. Let 
Set 2= ijet-l u;j;. Let also %,¢(X) be the set of uj; with i,j < k — 1 for which 


>» ujj < log X and =, ujj < log X and a ujj > (k-—2— 8) logXx. 


i<k-1 j<k-l ij<k-1 


Considering the vector u of u;; with i,j < k — 1 as fixed, let %,4(u, X) be the set of 
those uz; with i < k — 1 for which 


cj < logX forallj<k—-1 


Finally, again considering the u;; with i,j < k — 1 as fixed let Zp,(u, X) be the set 
of those uj, with j < k — 1 for which 


r, <logX foralli<k—-1. 


We set € = 1/./log X, say. First seeking an upper bound, we note that if we have 
Sp < (K-—2—€) log X then S; < (k—e) log X, and therefore the part of the integral 
where S,_-; < (kK —2 —€) log X contributes at most 


= _ 2 
xt “| Iduy, ... dug. < X*~* log® X. 
(uij)1<ij<kE[0,00)? icj,riSlog X Vij<k 


This is asymptotically negligible (for any fixed k) by our choice of €. Meanwhile, 
the part of the integral where S;_; > (k — 2 — €) log X is equal to 


/ exp(Sx-1) exp (4,1 a t4-1) x (9) 
Uk,e(X) Tex (u,X) 


xf exp (ua See a t-14) / exp(ux.t) du, ...duxx. 
TR (u,X) Uk, kiCk stk Slog X 
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Here the innermost integral is over those 
O < ue < log X — max(ug, +... + ug, Hee Eek), 


assuming the upper range of integration is at least zero. Therefore the innermost 
integral is certainly bounded above (extending the lower limit to negative infinity, 
and then performing the integration) by 


X exp ( —max(ug) +... + U1, Mig Fee + uk-14)): 


Substituting this in, it follows that (9) is less than 


exp ( min( Cis i) dujj. 
xf. = Fo ®*?( Ds a Il 


1<j<k-1 1<i<k-1 (i j)A (KA) 
(10) 


At this point we change variables, letting r|,...,7,—; and cy,...,c,—; run through 
the interval [0, log X] so that uj, = rj — vase ujj and uz j = Cj — Do <jcg—1 Mij- 
Since u;% = 0 and u,; = 0 this change of variable implies the additional condition 
that for all i,j < k—1, 


oz Ui STi and Ss Uujj SC; (11) 


j<k-1 i<k-1 


The Jacobian of this linear change of variable is equal to | since the linear 
transformation taking the (u;;) with (i,j) 4 (k, k) into (re, ce, uij) with i,j,€ < k—-1 
is upper triangular with only 1’s on the diagonal. 

Given r = (r|,..., 7-1) and ec = (cj,..., Cg—-1) we let Ue (¥, c, X) be the set of 
uj; With i,j < k — 1 satisfying the conditions (11) and the standing condition that 


Yo uy = k- 2-2) logX, (12) 
ij<k-1 
and we let F(X) be the set of 0 < 7,...,7%-1 < logX and 0 < cj,...,ch-1 < 


log X. Then (10) can be re-written as 
xf exp ( min( Yo Don )) Vo (%.e(. c, x)) [] aca 3) 
F(X) 1<j<k-1  1<i<k-1 i<k-1 


Since 7;,c; < log X for all i < k — 1, we have 


Vol(%.e(r, €, X)) < Vol(%.-(log X, log X, X)) 


= (log x)" . Vol(%,<(1, 1, e)) ~ (log X)""” - Vol(Yo(, 1, €)) 
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as X — oo, where logX := (logX,...,logX) and 1 := (1,...,1), and where 
we recall for the final asymptotic that ¢€ = 1/./logX. As already mentioned in 
the introduction Vol(%o(1,1,e)) = k-“-) Vol(A) where A, is the Birkhoff 
polytope. It follows that (10) is 


< (1 + o(1))X(log X)*""” - k- © Vol(B;) 


x J cx0 (min oe Cj; ba r)) I] dc;dr; 


1<j<k-1 1<i<k-1 i<k-1 


and by Lemma 4.4 this is less than or equal to 


2k —2 
(1 + 0(1))X(log X)"—* - "Vol Ay) XI! ( p= 


thus finishing the proof of the upper bound. 

For the lower bound we restrict attention, as we may (due to positivity), to the 
part of the integral where S,;_; > (k—2 +) logX and each 7;, c; is > (1 — €) log X. 
The point of the former condition is that if it is satisfied then 

Ure ture +... + Up-ik < (k— 1) log X — Spi < (1 — £) logx 


and similarly ug) + ug2 +... + Uee—1 < (1 — €) log X, and therefore 


log X — max (u +e FUR RH, Uk ee t-14) > elogX = JlogxX > ~. 


Therefore arguing as above the innermost integral over u;,, in (9) contributes 


(1 + o(1))X exp ( — max(ug1 +... + Ue k—1, Uk Fee + ux-1.4)): 


Proceeding as before we thus arrive to (13) but with the additional condition that 
(1 —«)logX < r;,c; < log X (and with the condition that re ujj = (kK-2—€) 
log X replaced by the condition that }°; j<k-1 Uij 2 (kK—2+ €) log X). It follows that 
on this set of 7; and c; we have 


Vol(%.e(r,¢,X)) > Vol(%.-((1 — €) log X, (1 — €) log X, X)) 
= (1 + o(1))(log X)*” -k- ©  Vol(F) 
Therefore we obtained the following lower bound 


(1 + o(1))X(log X)&—” - k-* Vol(By) 


«fff exp ( min( 2 Cj. > r)) I] de; dr; 
Tyee(X) 


1<j<k-1 1<i<k-1 i<k-1 
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where Tue(X) is the set of r;,c; satisfying (1 — e)logX < r;,c; < logX for all 
i < k—1. Note that the condition (1 — ¢) log X < r;, c; can be dropped. Indeed the 
contribution to the integral of any tuple of (r|,...,74%—1) or (c1,...,Ck—1) where at 
least one of the c;,7; is < (1 — €) logX is < X*-!-¢ and therefore negligible. Thus 
we can extend the integration to all of c;,7; < log X. Because of this Lemma 4.4 is 
applicable and we have therefore obtained the lower bound 


2k —2 


= (lp acy pea -k-*VVol(B,)X* - (log X)*&-»” 


as claimed. Since we have obtained asymptotically matching upper and lower 
bounds the proof of the proposition is finished. 


5 Proof of Theorem 1.4 


In the Rademacher case we have, letting LD) denote a generic square, 


(rm) = YO Bpln)...feo] 


n<X NY eee MX 
2 2 
= YE wa)... w) 
Ny ye. MSX 
Ny... N= 
Let g(m,...,m) be a multiplicative function of several variables, supported on 
square-free n;, and such that g = 1 when m,...n, = OU and g = O otherwise. 


Then we find that the Dirichlet series 


is equal to 


(+ > aarran): 


This factors as 


A(s1,..., 8x) I] (si + sj) 


l<i<j<k 
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with 


HG. D= TTA "4 > w) 
Pp 


1<j<k/2 


The main result of La Bretéche is applicable with (5) linear forms 0“)(s,, 59, 
..+,8%) = 5; + 8; defined for 1 < i < j < k. The rank of these linear forms is 
equal to k for k > 3 (for k = 2 the rank is equal to | since there is only one form 
in that case). Therefore applying La Bretéche’s result it follows that the moment is 
asymptotically 


(1 + 0(1))C.X*/2(log x) O-*. 


In order to determine the constant C;, one could use Théoréme 2 of La Bretéche, to 
conclude that the moment is asymptotically 


(1 + o(1))H(G,..., 5) Vol(B(X)) 


where B(X) is the set of (u;j)i<j € R&“~)/? such that 


forall <i<k: [us] [us <X 


ji i<j 


and then proceed in a manner similar to Theorem 1.3. However we leave this 
computation to the interested reader. 
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Large Values of the Zeta-Function 
on the Critical Line 


Aleksandar Ivié 


Dedicated to Professor Helmut Maier on the occasion 
of his 60th birthday 


Abstract This is primarily an overview article (Lecture given during the conference 
“Number Theory and its Applications Workshop” in Xi’an (China), October 23-28, 
2014.) dealing with the large values of | G + it)|. This approach allows one to 
obtain upper bounds for moments (mean values) of |¢ G + it)|, which is one of 
the fundamental problems of the theory of the Riemann zeta-function. A sketch of 
the upper bound for the 12th moment of D.R. Heath-Brown (Q J Math (Oxford) 
29:443-462, 1978) is presented, together with some recent results of the author. 
They include upper bounds obtained by the use of large values of E* (7), a function 
closely related to the classical function E(T), the error term in the mean square 
formula for | GG + it)|. A new large values result involving E£;(T), the general error- 
term function in the formula for the 2k-th moment of | (4 + it)|, is also given. 


Keywords Riemann zeta-function * Moments ¢ Large values 
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1 The Basic Properties of ¢(s) 


1.1 The Definitions 


The Riemann zeta-function is defined by 


co 


c(s)= on? =[[G-p°y" (1) 
P 


n=1 
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for ts > 1, where p denotes primes. For other values of the complex variable 
Ss =o + it(o,t € R) it is defined by analytic continuation. It is regular for s € C, 
except at s = 1 where it has a simple pole with residue 1. The product representation 
in (1) shows that €(s) does not vanish for o > 1. The Laurent expansion of €(s) at 
s = | reads 


1 
f(s) = jt rtuwse-Y)+vle-'+--, 


where the so-called Stieltjes constants y, are given by 
k+1 yy 


_ CDS ‘ log<m log - 
w= fm | eed (k=0,1,2,...), 


m<N 


and y = yo = —I''(1) = 0.5772157... is the Euler constant. For more properties 
of ¢(s) the reader should consult the monographs of Titchmarsh [47], Edwards [9], 
and the author [20]. 

A notable feature of ¢(s), whose analogues are true for many Dirichlet series (or 
L-functions) of the form 


CO 


F(s)= Caryn (Rs > 1), (2) 


n=1 


is the functional equation, proved first by Riemann [43] in 1859. In a symmetric 
form it says, that for s € C, 


x t(9)FGs) = x O61 — GU —s)). (3) 
Alternatively we can write (3) as 
G(s) = x(s)S—s), 
where 


= rods) s-1/2 
X(s) := ras) uA 


’ 


and J"(s) is the familiar gamma-function. Using the classical Stirling formula for 
I'(s), we have 


an \ ott 2 1 
x(s) — (=) eit 1/4) (: +O (<)) (s=o+it, t> 2). 
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1.2 The Selberg Class 


The best known generalization of ¢(s) is the Selberg class .Y of Dirichlet series (see 
the overview papers of Bombieri [2] and Kaczorowski [36]), defined axiomatically 
by Selberg [45] as follows: 


(a) The series in (2) converges absolutely foro = fis > 1 with ar(1) = 1, and the 
coefficients a(n) satisfy the bound a(n) «, n®° for every ¢ > 0. 

(b) There is a natural number mp such that (s — 1)" F(s), which is of a finite order, 
admits analytic continuation which is regular in C. 

(c) There is a function 


@r(s):= QT p(s)F(s) (Qr > 0) 
with 


Tr) = [[ (Ms + u(F)) A/F) > 0, Ruy) > 0). 


j=l 
Then the functional equation 
Pp(s) = orOr(1—s) — (Jr| = 1) (4) 


holds for s € C, where for any function f(s) we define f(s) = f(S). 
(d) We have 


log F(s) = )° br(n) A(n)/(n' logn), 


n=2 


where br(n) « n° for some 0 < 6 < 5 and A(n) is the von Mangoldt function, 


namely A(n) = logp ifn = p* (a € N) and A(n) = 0 otherwise. 
The quantity 


dp = 2) A,(F) 


j=l 


plays an important réle in the theory of .7. It is called the degree of F in 7, it does 
not depend on the form of the functional equation (4), and it is conjectured that it 
is always a non-negative integer. Broadly speaking, anything that holds for ¢(s) can 
be to some extent generalized to elements of .”. 


Notation. Owing to the nature of this text, absolute consistency in notation 
could not be attained, although whenever possible standard notation is used. 
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By N, Z,R,C we denote the set of natural numbers, integers, real and complex 
numbers, respectively. The symbol ¢ will denote arbitrarily small positive numbers, 
not necessarily the same ones at each occurrence. The symbols f(x) = O(g(x)) 
and f(x) < g(x) both mean that |f()| < Cg(x) for some constants C > 0 and 
x > xo > 0. By f(x) “ay... g(x) we mean that the constant implied by the <- 
symbol depends on a, b,..., while a =< b means that botha « band b « ahold. 


2 Moments on the 1/2-Line 


2.1 Basic Concepts 


In zeta-function theory, of particular interest are the moments (or mean values) 


T 
w(t) = fgg + in Pat 


Although the integral certainly makes sense if ik > 0, usually one takes k €¢ N 
and supposes that k is fixed. Namely in this case one can use the obvious identity 
|z|? = z-z, and if one has a “good” expression for z = € ks +it), then by multiplying 
the expressions for z and Z one can tackle /,(T), at least in principle. There are two 
monographs dedicated solely to the subject of mean values of f(s): the author’s 
[18] and Ramachandra’s [42]; see also the author’s recent overview paper [29] and 
Joyner [33]. 
The expression 


T+H 
I(T + H)-I,(T) = / IS + in|" dt 
r 


denotes zeta-moments in “short intervals” if 1 < H < T and H = o(T) as T > oo. 
It is also an important object of study. 
We have (Balasubramanian—Ramachandra; see [42]) 


ttc 


Ic(5 + yl < toer( | 


tc. 


Io(4 + ir)|K dr + i) (KEN, c>0). (5) 


This bound allows one to transform discrete zeta sums into sums of integrals over 
short intervals. In this process one loses only an (unimportant) logarithmic factor. 

Conversely, one trivially can transform integrals of continuous functions f(x) into 
discrete sums by means of the elementary inequality 


i * FO) dx 


< (b-a) se If (x)| (a <b). 
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2.2 Lower Bounds for é + it) in Short Intervals 


Ramachandra [42] showed unconditionally that one has 
T+H P 
/ I¢(5 + in)" dt >, H(log Hy (KEN, loglogT<H<T). (6) 
T 
He used complex integration and the fact that, for w = u+iv (u,v € R; |u| < 2/2), 


| exp(— cos w)| = lexp(—3(e" + e™)) | 
— lexp(—H(e"e™ + eve'))| = exp(— cos u: cosh v). 


The above kernel exp(— cos w), which decreases like a second order exponential 
in |v|, sets the limit to the lower bound for H in (6) (as a multiple of log log 7). 
According to W.K. Hayman, this is essentially the best such kernel function, but it 
is an open problem whether, by some method different from Ramachandra’s, one 
can improve the range for H in (6). 

Recently Radziwill and Soundararajan [41] showed that (unconditionally) 


y 
i [C4 + in)?! dt > e" Tlog T)* 
0 


holds for any real k > 1 and T = Tp. This bound not only holds for any k > 1, but 
it is explicit and at the same time continuous in k, although the constant e 30k" ig 
certainly not the best one possible. 


2.3 Upper Bounds for és + it) in Short Intervals 
Under the RH 


Upper bounds for /,(T) are much more difficult to obtain than lower bounds. In 
2010 Soundararajan [46] showed that, under the Riemann Hypothesis (RH, that all 
complex zeros of ¢(s) have real parts 1/2), 


T 
/ [eC + ink at «K, Tog Tt, (7) 
0 


which is valid for any fixed k > 0 and any given e > 0. This result, when H = T, 
comes very close to Ramachandra’s unconditional lower bound (6). 

The author [25] obtained an improvement and generalization of (7), using 
Soundararajan’s method: let H = T°? where 0 < @ < 1 is a fixed number, and 
let k be a fixed positive number. Then, again under the RH, we have 
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T+H 
, Ic(5+in/*dt « H(log TP (1+ 00/1083.) 


T-H 
where 
log, T := log log log T. 


Later in 2013 Harper [11] obtained the conjectured upper bound (also under the 
RH) for any fixed k > 0, namely 


T 
[eg tinrtar < roog7y. (8) 
0) 


Harper’s method can be carried over to obtain the corresponding improvement for 
I,(T + H) — I(T) as well. Essentially this result is best possible, since for H = T 
Ramachandra’s lower bound (6) (which is unconditional) for i,(7) is of the same 
form as the right-hand side of (8). 


3 The Conjectural Formula for /;,(T) 


It is generally conjectured that 
[,(T) = Tp,2dog T) + E;(T), (9) 


where p,2(y) is an (explicit) polynomial in y whose coefficients depend on k. The 
function E;,(7) is thought of as an error term function, in the sense that one expects, 
for fixed k € N, 


E,(T) = o(T) (T > ov). 


Unfortunately, so far an asymptotic formula of the above shape is known, despite 
many efforts, to hold only in the cases when k = 1 ork = 2. The case k = 1 
(namely, the function E)(T) = E(T)) is extensively discussed in [18, 20], while for 
k = 2 (the function E2(7)) see [30, 31, 40]. 

There is, however, a conjectural formula for I,(T) which of a similar shape as (9). 
The formula in question is due to Conrey, Farmer, Keating, Rubinstein and Snaith 
(7, 8, 37]. It is of the form 


t 


T T 
nr) = f 6 + inf ar = +00) [ P, (los(5— 


)) dt (IT > ov). 
(10) 
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In (10) P(x) is the polynomial of degree k?, all of whose coefficients depend on the 
(fixed) integer k > 1, given explicitly by the 2k-fold residue 


Pi(x) oy / / m 
y= ee oe 
‘ ki? (Qx1)* lal=e1 cox |= e2% 


G(z1,-- ++ ZK) A?(Z1,.- + Zax) 
. He ak exp{} sé = zita)} dz... dz, 
i=1%7 i=1 


where the ¢;’s are small positive numbers. We have 


A(Z1,.++52m) = I] (Zz -Zy= le lates 


l<i<j<m 


which is the Vandermonde determinant, 


k ek 
Gzi..-- 22x) = Ae(zis.- 220) TT] $0 + - a0) 


i=1j=1 


and finally A; is the Euler product (as usual, e(@) := exp(27i0)) 


Ax(Z1, o- ., 22k) = = I] 7 d—- ptt) x 


Pp ij=l 
« [T (1 = e(@)p- i) "(1 e(-ayp 24) a8. 


The authors actually conjecture, in all cases, an error term in (10) of the order 
Ox..(T'/***), which in the general case this author finds to be too optimistic. 
In fact their paper [8] brings forth a conjecture (via characteristic polynomials 
from Random Matrix Theory; see, e.g., the work [39] of M.L. Mehta on random 
matrices) for the complete main term in the asymptotic formulas for a wide class 
of L-functions. The coefficients of P;(x) are given explicitly when 2 < k < 7, and 
numerically computed moments are compared to the values obtained for the main 
term by the moments conjecture. We mention that further numerical calculations 
involving P,(x) were carried out by Hiary and Odlyzko [13], and Rubinstein 
and Yamagishi [44]. In all cases when an asymptotic formula for the moment 
was rigorously proved, the main term in question coincided with the expression 
predicted by the authors, which renders the conjectural formulas quite important. 
Heretofore there have been several conjectures for particular L-functions, to mention 
here just the works of Keating and Snaith [37], and Conrey and Gonek [6] on the 
conjectural formula for i,(T). 
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4 The Large Values Technique for Zeta-Sums 


4.1 Introduction 


There are several approaches to the study of (unconditional) upper bounds for /,(T) 
(or more generally ,(T + H) — i,(T)). We single out the following: 


1. 


Ss 


Large values techniques through Dirichlet polynomials of the type >> f(n)n7 
=N 


and things like the Haldsz-Montgomery inequality (see, e.g., the Appendix 
of [20]). 


. Spectral theory/Kloostermania. This is the approach developed by N.V. 


Kuznetsov, Deshouillers-Iwaniec, R. Bruggeman, Y. Motohashi (see his 
monograph [40]), and others. It leads to good results on /2(T), namely produces 
a sharp formula for the (weighted) fourth moment of Ic + it)|, from which 
results on E2(7T) can be derived. 


. The general binary additive divisor problem, involving exponential sums with 


d,(n)d,(n + h), where the shift parameter h is not fixed, and the general divisor 
function d,(n) is generated by ¢*(s). For the case k = 3, interesting in the case 
of the estimation of J;(T), see the author’s work [19]. 


. Connecting bounds for /,(7) to bounds involving E(T) = E\(T) and the related 


function E*(T). This function will be defined in Sect.7 and used in bounding 
I,(T) in Sect. 8. 


The purpose of this paper is to focus on the approaches 1. and 4. In particular, 


a sketch of the proof of Heath-Brown’s classical estimate (11) (see [12]) will be 
given, and this result (up to a log-factor) will be reproved in Sect. 9. 


4.2 The Large Values Technique 


Suppose t, (r = 1,...,R) is a set of points lying in [T, T + H] (1 < H < T) such 
that 


ICG +it)|>V. |t--tl>G(r#5, 1K G< Hi rs=1,...,R). 


The points ¢, are then said to be well-spaced. 


A good upper bound for R will lead to a good upper bound for 


ye at, 


r<R 
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hence also to a good upper bound for 


T+H 
/ eh + it)|Aar, 
T 


where k > 0 is a suitable constant. 

This is the underlying idea of the large values technique, which obviously can 
be used for the estimation of moments of other L-functions, in particular to those 
belonging to the Selberg class .7. 


5 Atkinson’s Formula for the Mean Square 


The formula of Atkinson (1916-2002) [1] for the mean square of | (3 + it)| (1949) 
is a fundamental result in zeta-function theory. It was used later in 1978 by Heath- 
Brown [12], who employed a large values technique, to prove the 12th moment 
estimate 


T 
/ Ic(5+in| dt «< T’(logT)!”. (11) 
0 


This result is even today essentially the strongest upper bound for /,(T) when k = 6. 
Let 


T 
E(T) = | e+ infPar— 7 (og() + 27-1). 


Here d(n) is the number of divisors of n, and y = —I’’(1) = 0.577215... is Euler’s 
constant. The function E(T) is a fundamental function in zeta-function theory, as 
(7) = fe Ie(5 + it)|? dt is the fundamental moment of le(s + it)|. Atkinson’s 
explicit formula for E(T) is 


E(T) = D\(T) + 2(T) + O(log’ 7), 


B(1) = 27 (7/(2n))'* SY (-1)"d(n)n/*e(T, n) cos(f (Tn), 


n<N 


= a) a ey Pe 
X(T) := 2 ni (log T/(Qnn)) COs (ri0e(—) T+ :*) : 


f(T,n) = 2Tar sinh (/n/(2T)) + V2anT + nn? — in 
= —in + 2V20nT+ 


+ LV Onin? TM? an OT OP sy OT OF eas 5 


180 A. Ivié 


e(T,n) = (1+ ren] (2T))-"44 (27 /en)"7ar sinh (vmnj@ry)\ 
= 1+ O(n/T) (d<n<tT), 


T N N2 NT 
ar sinh x = log (x + 1+2#), N= he sa + : 
2x 2 4 20 


The sum >°,(7) in Atkinson’s formula is analogous to the sum approximating 
€7(s) (essentially sums of d(n)n~ of length x |t\|; see, e.g., Chap. 4 of [20]), and 
by averaging processes (e.g., Gaussian weights exp(—(7/G)7)) its contribution in 
many instances may be made fairly small. Its strength lies in the facts that it is 
explicit, contains two sums with elementary functions and the error term is small. 


6 The 12th Moment Estimate 


6.1 Technical Preparation 


We sketch now the proof of (11), namely the following 
Theorem 6.1. We have 


T 
/ Ie(5 + in| dt « T*(logT)". 
0 
First we have, for T® < G < T!/?-*, L := logT, 


T+G GL 
/ (5 +in? dt < | [¢(5 + iT + it)’ exp(-P°'G”) dt. 
T-G —GL 


The presence of the exponential factor makes it possible, by using Atkinson’s 
formula and the classical integral 


/ ~ exp(Ax — Bx?) dx = (2°/B)'/? exp(A?/(4B)) — (RB > 0), 


Co 
to obtain the following 


Lemma 6.1. 
T+G 
/ Ic + in|? dt < GlogT 
T— 


. 2 
+ GV(TKY"(|S(K)| +i f IS] dx) e-P4I7, 


K 0 
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Here 


S(x) = S(Q.K.T):= YP (-1)"d(n) explif(7.n)), 


K<n<K+x 


f(T, n) = 2Tar sinh (/xn/(2T)) + V2nnT + 7?n2 — iT, 
and summation is over K = 2* such that T!/3 < K <N , where for 5 > 0 fixed 


N := B?/(T/(x)—B), B:= T(2nG)"'log!**T. 


6.2 Reduction to Exponential Sums 


Very often problems in analytic number theory are ultimately reduced to the 
estimation of exponential sums, namely classical van der Corput sums, sums over 
primes, sums over Hecke series, Kloosterman sums, etc. 

By means of Lemma 6.1 we shall prove first (see [20]) 


Theorem 6.2. Let («,4) be any exponent pair with k > 0 and let ty < --- < tr 


satisfy 

lel <7, [CG + it)| = V>0(r <7), | -4]/ 210° #5). 
Then 

R<TV log? Fe TOMY Ate (log TPT TO, 


Remark 6.1. Recall that («, A) is an (one-dimensional) exponent pair if 


S> exp(if(n)) K MF* (0<F <1), 


N<n<N’<2N 


if f(x) (€ R) is sufficiently many times differentiable in [V, 2N](N > 2), and 
essentially 


f©@) =<, FNO* = (kK =1,2,...). 


For a detail account on exponent pairs the reader is referred to the book of Graham 
and Kolesnik [10]. 


Thus trivially (0, 1) is an exponent pair, other common ones are 


(1/2, 1/2), (1/6,4/6), (2/7,4/7), (1/14, 11/14) 


182 A. Ivié 


etc. Taking (k,A) = (1/2, 1/2) in Theorem 6.2 we obtain 
R <« T’V-" log'®T, 
and Theorem 6.1 easily follows. 
Theorem 6.2 follows from 
Lemma 6.2. Let of be a set of numbers {t,} such that T/2 < t, < T, log? T < G< 


|t, —t;| < J forr # s. With 


S(x) = S(%,K,T):= > (-1)"d(n) exp(if(T,n), 


K<n<K+x 
f(T, n) = 2Tarsinh (/an/(2T)) + V2nnT + 1?n? — Gn, 


we have 


S(x, K,t,)| << K+ K47'4G 2 Jog!/? Tl. |!/2 
g 
treo 


f Se a A oa a log?/? T. 


The result is obtained by the use of the Hal4sz—Montgomery inequality (see, e.g., 
the Appendix of [20]) 


1/2 
YI. ¢)1 < Ell § (6,601) (12) 


rsR r.s<R 
For Dirichlet polynomials (12) is used in the following context: If a = {a,}° , and 


b = {b,}°2 | are two (vector) sequences in C, then their standard inner product is 


CO 


(a,b) := J > dnbn. (13) 


n=1 


This is used to remove the coefficients (—1)"d(n) from the sums of S(x, K, t,) 


and get ordinary exponential sums. Hence, withE = > (—1)"d(n), we have 
K<n<K+x 


glP=EH= Yo Pn) «K Klog’T 
K<n<K+x 
and by (12) and (13) the left-hand side of the sum in Lemma 6.2 is 
1/2 


= exp(if(t, n) — if (ts, ») 


K<n<2K 


<< K'/? \og3/? T > 


tr tsEDA 
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The terms for r = s are trivially O(K), and the terms r # s are estimated by the 
theory of exponent pairs. Lemma 6.2 follows. 

The term TV~° in Theorem 6.2, which would lead to the yet unknown sixth 
moment 


T 
i Io(5 + in|° dt « Tlog® T, 
0 


comes from the term with |.c7|!/* in Lemma 6.2. 


The other term is small if J is suitably chosen, namely it holds for Ro, the number 
of t,’s lying in an interval of length < J. Thus the obtained bound for R has to be 
multiplied by Ro(1 + T/J), and this gives the other term in Theorem 6.2. Up to 
the choice of exponent pairs, this is the limit of the method. Note that there are 
new (non-classical) exponent pairs, obtained by Huxley [14—16] and Huxley and 
Ivié [17], for example. Huxley’s work is built on the Bombieri—Iwaniec method for 
the estimation of exponential sums; see [3, 4]. There is a way to avoid the use of 
the Halasz—Montgomery inequality in proving (11) and work only with the simpler 
Cauchy—Schwarz inequality. For this, see Chap. 8 of the author’s monograph [20]. 


7 Large Values of |¢ G + it)| via the Divisor 
Problem and E*(T) 


7.1 The Function E*(T) 


Already Atkinson’s formula provides the analogy between E(7) and the error term 
in the classical Dirichlet divisor problem, namely the estimation of the function 


A(x) := Y/d(n) — x(logx + 2y — 1). (14) 
We have 
1 1 3 
A(x) = x4 S d(n)n—* cos(4 /nx — 121) 
a2 » , (15) 


+ O-(x2t®N-2) (1K NX2), 


which is the classical truncated Voronoi formula for A(x) of 1904 (see Voronot [48] 
and Chap. 3 of [20]). 
A better analogy is to consider, instead of A(x) (see Jutila [34, 35]), the function 


A*(x) : 


II 


—A(x) + 2A(2x) — 5 A(x) 


; Y > (-1)"d(n) — x(logx + 2y — 1), 


nZ4x 
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because Atkinson’s formula has a factor (—1)” in it, and the formula for A(x) does 
not. Then it makes sense to investigate the function 


E*() = E()- 20 A*(=-). 


Namely, we have the explicit formula 


1 


A* (x) = aa S\(-D"d(n)n-4 cos(47 ./nx — im) 


aV2 few (16) 
+0,(x2+*N-2) (KN Kx), 
and without (—1)” the formula (16) is exactly Voronoi truncated formula (14) for 
A(x). 
If N = T!/3-*, then the first N terms in the truncated Voronoi formula for A* (x) 


and }* (7) in Atkinson’s formula are asymptotic to one another. This is the analogy 
between A*(x) and E(T). 


7.2 Results on E*(T) 
Jutila [35] proved: 
T+H 
/ (E*(t))° dt «<, HT log T+ T'** (17) 
T 
for | < H = H(T) < T.The significance of (17) is that, in view of 
T T 
/ (A*(t))? dt ~ AT??, / E’(t)dt ~ BT?/?, 
0 0 


for A,B > 0,T — on, the function E*(f) is in the mean square sense of a 
lower order of magnitude than either A*(t) or E(t). The author [26] in 2012 
complemented (17) with the lower bound 


T+H 
/ (E* (t))° dt > AT"? log’ T (18) 
T 


for T73+* < H = H(T) <T. 


Remark 7.2. The bounds (17) and (18) show the true order of the integral. Problem: 
What is the lower bound for H = H(T) for which (18) holds? The author [23] 
proved in 2007 the asymptotic formula 
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T 
i (E*(t))* dt = T*/P3(log T) + O,(T/*t®), 
0 


where P3(x) is a cubic polynomial in x with positive leading coefficient, and all 
the coefficients may be written down explicitly. It is reasonable to conjecture that 
the above error term is O,(T!**) and at the same time (2(7), but this is certainly a 
difficult problem. However the above formula does imply neither the upper nor the 
lower bound for (E*(t))* in short intervals. The author also proved [21, 22] 


T 
i lE*(Par «<, 78/2 (19) 
0 
and that 
: fo} 
i JE*(har «, 72+", (20) 
0 


Hence by the Cauchy—Schwarz inequality for integrals it follows that 
“ 4 
/ ler(@|*ar <<, TI, (21) 
0 


An open problem is to investigate higher moments of E*(t). In particular, it would 
be interesting to show that the integral of (E*(t))° is of a lower order of magnitude 
than that of |E*(1)|>. 


Theorem 7.3. Suppose that the bound 
T 
/ (E*())’ dt «, 73/27 
0 


holds for some constant n such that 0 < n < 1/6. Then for any € > 0 there exist 
constants To(e),A > 0 such that, for T > To(e), every interval [T, T + T!"**] 
contains points T,, T2 for which 


E*(Ti) > AT lor? 7, E*() <—AT) log"? T, 
Remark 7.3. This is a recent result of the author [25]. Note that, for A > 2, one has 
T 
i |E*(t)|4 dt >A T'*4/6 (log pe, 
0 


Remark 7.4. We have 


ij 
E(T) = 2nA*(——) + Q(T" log?” 7). (22) 
IU 
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A plausible conjecture is that 
* T 1/4+¢ 
E(T) = 2nA .) +O (T ). (23) 
I 


This is compatible with the classical conjectures (implied neither by the Lindel6éf 
Hypothesis (that ¢ G + it) <, |¢|*) or Riemann Hypothesis!) 


AC ew ie al Moye, (24) 


but perhaps, in view of (22), the true exponent in (23) is even 1/6 + ¢. 


Remark 7.5. Lau and Tsang [38] showed that A(x) «, x“t® implies A*(x) <, 
x“t® and conversely, A*(x) «, x?t® gives A(x) <, x°**. In the other direction, it 
is known that a < 1/4 and b < 1/4 cannot hold. 


8 Connections of E(T) and E*(T) with the Moments 
of |5(4 + i0)| 


8.1 Large Values Estimates 


There are several ways to connect power moments of | (3 + it)| with the moments 
of E(T) and/or E*(T). The author [25] proved 


Theorem 8.3. Let t),...,tr be points in [T, 2T| which satisfy 
td +i)|2Vver 


and |t, —t;| = 1 forr,s < Randr # s. Then we have, for L = log T, G = A(V/L)? 
with a suitable constant A > 0, and k € N fixed, 

3T 

RK Vr2eky2t2k / 


T/3 


|E(t + 2G) — E(t — 2G)|* 


+ |E(t+ $G) — E@— 5G)|*> dr. 


Remark 8.6. The presence of the factors 
|E(t + 2G) — E(t— 2G)|*, |E(t + $G) — E@— 5G)|' 


makes one feel that it is possible to exploit the “closeness” of the arguments. Indeed, 
Theorem 8.3 is valid with E* in place of E. 
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Corollary 8.1. Suppose that the integral on the right-hand side in Theorem 8.3 is 
bounded by O,(T!+*+®G®) for some real constants a = a(k) (> 0) and B = 
B(k) <k—1, andT® < G= G(T) « T"”?. Then we have 


T 
/ [eG + Pre? dt a. Pitate 
0 


We also have the following result (see [24]): 
Theorem 8.4. For T?/2 < G= G(T) « T'/? we have 


2T 

4 
[ (c+ G) - E(t 6) dP, Te, 
T 


and the same result holds with A instead of E. 


Corollary 8.2. As shown in [24], if the above bound holds for T’ < G < T'/?, then 
the weak 8th moment 


T 
: Ich +inledt «, Tit 
0 


holds. 


Remark 8.7. The above bound holds for E*(7) in place of E(T) in view of the 
fourth moment bound for E* (7). 


A recent result of the author [28] is the following: 
Theorem 8.5. Let t\,..., tp be points in [T, 2T] which satisfy 
IfG + it)| = V2 T° 
and |t, — t;| > 1 forr,s < Randr # s. Then we have 


3T 
R«<V**logT i: |E* (¢)|* de. 


T/3 


From the results on moments of E* (7) [see (17)-(19)] we obtain, with k = 3, 4, 


Corollary 8.3. 


E T 
/ ICG + in| dt «, T3*, / [o(4 + it)|!o dt <<, TAT, 
0 0 


We also have, with k = 5, 
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Corollary 8.4. 


T 
/ Io(4 + it)| dt «, T°**. 
0 


The bounds in Corollaries 8.3 and 8.4 are, up to “e’’, the sharpest ones known. 
D.R. Heath-Brown [see (11)], obtained T? log!” T as the bound for the last integral. 
Corollary 8.4 thus provides an alternate proof of Heath-Brown’s result. 

Theorem 8.3 on large values follows by using (C > 0,L = logT,t = T) 


t+G Cc roe ek 
i; I¢(5 + iu)|? du < al xE* (t+ xe! dx + CGL’ (25) 
1-G G —GL 
and the fact that 
A*(x + H)— A*(x) <«, Hx’ (1 «KA <x), 
which is a consequence of the elementary bound 


d(n) <, n® 


and the defining relation 


A*(e) = 3 >> C1)"dQn) — x(logx + 2y - 1). 


nS4x 
Conjecture 8.1 (M. Jutila). We have 
A*(x+H)— A*(Q) «, VHE (KH <2). 
This conjecture is motivated by mean value results for A(x + H) — A(x) and 


A* (x+H)—A* (x) (see [25]). Recently Wenguang Zhai and the author [32] proposed 
a similar 


Conjecture 8.2. Suppose that 
logT <U<T'?/10, 17? <H<«T, HU» T'*. 


Then the estimate 


T+H 
max 
T O<u<U 


holds for some absolute constant c > 0. 


2 
A(x + u) — A(x)| dx « HU log’ T 
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This is a very strong conjecture, since it implies 


Corollary 8.5. If Conjecture 8.2 holds, then A(x) «, x!/4*®, and (by Remark 7.5) 
also A* (x) «, xi/4t8, 


From (25) we have the following 
Corollary 8.6. If E*(T) <. T°t® (0 < @ < 1/3), then 


ca +i <«, [@?t. 


Thus if the conjectural E*(T) «, T!/°+* holds, then we have (the very strong 
bound) 


CA +it) Ke [ere 


Let us mention that the best unconditional exponent for |¢ G + if)| is 32/205 = 
0.15609..., which is a result of Huxley [16] (very recently Bourgain [5] in 2014 
improved Huxley’s exponent to 53/342 = 0.15497...). The yet unproved Lindelof 
hypothesis states that &G + it) <«, |t|*. It follows from the RH (see, e.g., Chap. 1 
of [20]), but the converse is unknown (and not very likely!). 


9 A New Large Values Result Involving the Function E;(T) 


Let E,(T) be the error-term function defined by (9). We present a new result which 
generalizes Theorem 8.5. This is 


Theorem 9.6. Let E,(T) be is as in (9). Then, for a fixed m (= 1) and L = logT, 
we have 


2T 2T 
/ Ie oe 7 al dt <, Tite a pante+1 / |E;(t) |” dt. (26) 
T T/2 


Moreover, when k = 1, (26) remains valid with E\(t) = E(T) replaced by E*(t). 


Theorem 9.6 is a consequence of the following large values estimate. Let 
ti,..., te be points in [T, 27] which satisfy 


If(5 + it,)|>V2T (27) 
and |t, — t,| > 1 forr,s < Randr# s. Then we have 


3T 
R < yr 2knt l) 7 amie / |Ey(t)|" dt. (28) 


T/3 
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Namely we divide the integral on the left-hand side of (26) into O(L) integrals of 
the form 


2T 
/ Ck + in Pert dt << RyV2kert) 
T,.VS|¢(5 +i)|<2V 


where Ry is the number of points t,,y each of which satisfies (27) (with t, = t,.v) 
and |t,y — tsv| > 1 for ry,sy < Ry. The points t,,y are chosen in such a way 
that |¢ (G5 + it)| attains its maximum value in suitable subintervals of length not 
exceeding unity, lying in [T, 27]. Applying (28) to bound Ry one obtains (26), since 
the contribution of V which are < T° is trivially O(T'**). 

To prove (26) we start from (5) to obtain 


YileG + ink « (of [ “(6G + in Pde + i) (29) 


r<R r<R 


The interval [T, 27] containing the points {t,} is covered with those intervals of 
length 4GL (T’ < G < T'/) (except the last which may be shorter) which have 
a non-empty intersection with at least one interval of the form [t, — 7 tp + 5]. The 
centers of such intervals are denoted by #/,, and we suppose that there are R’ (< R) 
such intervals. Moreover, by considering separately the points {#/,} with even and 
odd indices r’, we are led to the estimation of the sum 


t,+2GL 
> 2 (4 + it)|* de, (30) 
<R! 
where |t/, — t,| = 4GLif r’ # s’. Now we recall (9) and set Q2 := py2 + pi. Since 
(H = 2GL) 


tp +H 6 a 
/ Io + in de < ef ECE + it, + ix) Peo dx 


-—H ss 
. 2 
=F 7 [+ Oe(t +x) + E(t +)] eo dx, 
—oo 
it follows that the expression in (30) is 


[o,) 
< RG log” T+ = cf xE;(t. + el dy 


1 <R’ 


& R'G log” THE ->f |E,(t. + x)| dx 


1 <R’ 


") +2GL 


= R'Glog* r+ = 3 [ |E,(t)| dt. 


wep ot —2GL 
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Since the intervals (t,,-2GL, t!, + 2GL) are disjoint we obtain, on applying Hélder’s 
inequality (m > | is fixed), that the last sum above does not exceed 


1-1/m /,42GL my 1/m 
Y<R! <P! ',—2GL 


#', +2GL ¢,+2GL ms 
FO) ams ay) lec" ar{ f 1dt 
’,—2GL t 


r<pR! 4 ”,—2GL 


1/m 


1/m 


/\1—1/m oe m 1-1/m 
<& (R’) i IE(p)\" dt) (GL), 
T/2 


If (27) holds, then the sum in (29) is >> RV". Therefore we obtain 


1/m 
LB 5T/2 
Rv2* < RGU tH +4 (Ry / |Ex(t)|"" dt (Ly, 
G T/2 
Now choose 
G = cv¥L-#! (31) 


with a suitable c > 0, so that R’'GL" +! is dominated by RV“. Simplifying the above 
bound and taking into account that R’ < R, we have 


5T/2 
R< LG (GLY i, |E,(t)|"" dt. (32) 
T/2 
With G given by (31), it is seen that (32) simplifies to 
; 5T/2 
R< pintk ya 2k(m+1) i |Ex(t) |" dt, 
T/2 


and this is (28). The proof in the general case is complete. The case k = 1 follows 
on using (25) and the above arguments. 

Besides results obtained from the case k = 1 (Corollaries 8.3 and 8.4), we may 
consider the case k = 2. It was proved by Motohashi and the author [30, 31] and 
Motohashi [40] that 


T T 
i. |Ex(t)| dt « T?/?, i |Ex(t)|? dt < T? log” T. 
0 0 
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Thus with k = 2 andm = 1, m = 2, we obtain 
- T 
/ I¢(4 + in|§ dt « T*” log’ T, / I¢(L + in| at < T? log**T, 
e 0 


which are (up to log-factors) the sharpest known results. 

The present knowledge about the function E;,(T) does not make it possible to 
exploit the moments of E,(T) when k > 3. However, there are some heuristic 
reasons to believe (see, e.g., [29]) that E;(T) <«, T?/4*® holds. Already this 
pointwise bound would lead to 


T 
ie Io(4 + in)? dt «, TATE, 
0 


hence to a substantial improvement over Heath-Brown’s classical bound (11). In 
fact, any bound of the form E3(T) <, T“t® with a < 1 would improve (11). 
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A Note on Bessel Twists of L-Functions 


J. Kaczorowski and A. Perelli 


Dedicated to Professor Helmut Maier for his 60th birthday 


Abstract We show that certain twists of Bessel type of a given L-function have 
meromorphic continuation over C. 


Keywords L-functions ¢ Selberg class * Twists * Bessel functions 
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1 Definitions and Statements 


The purpose of this note is to obtain the meromorphic continuation of certain twists 
of Bessel type of a given L-function. 

By an L-function we mean a member of the extended Selberg class S*, namely 
the class of Dirichlet series with continuation over C with at most a pole at s = 
1, and satisfying a general functional equation of Riemann type. Almost every L- 
function from number theory and automorphic representation theory belongs, at 
least conjecturally, to S*. See Selberg [17], Conrey-Ghosh [2], Ram Murty [11], 
and our survey papers [4, 6, 14, 15] and [16] for definitions and basic results on the 
Selberg class theory. In particular, we assume that the reader is familiar with the 
notions of degree and conductor of F € S*, respectively denoted by d and q. 
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By a twist of Bessel type, Bessel twist for short, we mean a twist of the following 
form. Let B(E) be a complex valued function of the real variable € > 1 with the 
property that as € > +00 


Be) ~ Da 0 < vy < Vay < Vag <> 00, (1) 
keZ 


where ~ means that for any positive integer K, B(&) equals the sum of the terms 
with |k| < K plus Ox(&~ ™™K+1-"-k-1)), Moreover, e(x) = e?”*, cy € C and the 
functions f;,(€, «) are of type 


Nx 
filE, 00) = D> oyé (2) 
j=0 


with a; = aj(k) € 0, Kj = K(k) andO < ky, < Kn,—-1 < +++ < Ko. Clearly, B(é) 
is bounded for € > 1. The twist of F ¢ S* by B(E) is defined for o > 1 by the 
absolutely convergent Dirichlet series 


CO 


Pos.B) = So NBO, 


ns 
n=1 
where a(n) are the coefficients of F'(s). The function F(s; B) is called a Bessel twist 
since the Bessel functions are typical examples of functions B(&); see below. In 
particular, as a very special case of the Theorem below we obtain that the twist 
3 a(n)J, (2ran* ) 


ns 


(3) 


n=1 


where J,,(z) denotes the familiar Bessel J-function and 0 4 a@ € R, has 
meromorphic continuation over C if k < 1/d, d being the degree of F(s). Moreover, 
it is entire whenever k < 1/d. 


Theorem. Let F € S' with d > 0 and B(£) be as above with all ky < 1/d. Then 
F(s; B) is meromorphic over C. If all fx(E, &) have a term with exponent < 1/d, then 
F(s; B) is entire. 


It is clear from the proof that we have some control on the poles of F'(s; B), as well 
as more precise information about its analytic character in certain cases. Moreover, 
we can also get meromorphic (or entire) continuation of F(s; B) in certain cases with 
ko > I/d. 

We finally remark that Noda [12, 13] obtained, by a different method, the 
meromorphic continuation of the twists of the Riemann zeta function by certain 
Bessel and hypergeometric functions, respectively. These results are now special 
cases of our Theorem, see the examples below; however, Noda obtained also several 
representations for such twists. 
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2 Proof 


The Theorem is a consequence of the properties of the nonlinear twists obtained in 
our papers [5, 7] and [9]; we first summarize the main features of such results. Given 
F € S' with d > 0 and f(E, «) as in (2) with ko < 1/d, the nonlinear twist 


CO 
a(n) 
Fis; f) = —f(n,a 
(sf) 2 5 eH (.@) 
is entire if f(€, «) has at least one exponent < 1/d. If instead f(€, @) is the standard 
twist a&'/4,0 4A a € R, then F(s;/f) is still entire if |a| ¢ Spec(F), while it has 
meromorphic continuation to C with at most simple poles at the points 


Sk = —— ——— ip k=0,1,... 


and nonvanishing residue at so if |a| € Spec(F’). The spectrum Spec(F) is defined as 
Spec(F) = {a > 0: a(nq) # 0} where ng = qd~4a4 and a(ny) = Oif ng ¢ N, 


while the real number 6; is the internal shift of F(s) (see, e.g., [9]). Moreover, when 
ko < 1/d the function F(s;f) is of order < 1 whenever entire. Further, F(s;f) has 
polynomial growth on vertical strips when f(&, «) coincides with the standard twist 
aé!/4, and satisfies the weaker bound F(s:f) « exp(|t|°) with some 6 < 1 in the 
other cases. 

The Theorem follows from the above results. Indeed, thanks to (1), for any given 
positive integer K we may write 


F(s;B) = )) ceF(s; fe) + Re (8) = Mc(s) + Re(9), 
|k|<K 


say. But Mx(s) is meromorphic over C (or entire) by the above results, and Rx(s) 
is absolutely convergent for 0 > — min(vg+1, v-x—1) + 1; the result follows since 
such a minimum becomes arbitrarily large as K increases. 

The above results about F'(s; f) show that in principle we may gain some control 
on the poles of F'(s; B). In addition, we have some control on the growth of F(s; B) 
on vertical strips. Moreover, under suitable information about the order of growth 
of the constants involved in (1), we can also deduce that F(s;B) is of order < 1 
when it is entire. We finally remark that in [8] and [10] we obtained the analytic 
properties of F(s;f) in certain cases with ko > 1/d, hence corresponding result can 
be obtained for F(s; B). 
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3 Examples 


As we already remarked, the Bessel functions are typical examples of functions 
B(&). Indeed, by (3) and (4) of Sect. 7.13 of the Bateman Project [3], the Bessel J- 
and Y-functions satisfy, respectively, 


K-1 
Jy (2) = (12/2)7"/fcos(z — va /2 — 1/4) y- or 
k=0 


K-1 
dk 
—sin(g— vx/2— 7/4) aaet} + O(lz x) 
k=0 


and 


K-1 
¥, (2) = (2/2)~\/{sin(g — vr/2 — 17/4) )* = 
k=0 


K-1 
+ cos(z— va /2— /4) )> sea} + Ol Po) 
Ri" 


as |z| > oo in —w < argz < 7, where c, and d; are certain explicit coefficients. 
Writing sin z and cos z in terms of e~” and choosing z = 27f(&, a), by the Theorem 
we obtain the continuation over C as meromorphic or entire functions of the Bessel 
twists 


ns 


> a(n)J, (2sf (n, «)) na a(n)Y, Gale w)) 


n=1 


provided F € S# has degree d > 0 and f(&,@) has Ko < 1/d. In particular, we may 
choose f(E, @) = w& witha € Rand ko < 1/d. 

We can apply the Theorem in many other similar situations, for instance with 
arbitrary Fox hypergeometric functions H(z) in place of the above J,(z) and Y,(z). 
In fact, Braaksma [1] shows that any such function admits an asymptotic expansion, 
for z = € — oo, as a finite sum of terms of the form 


aD aeéeC., 


Given F ¢ S', the twist F(s,H) converges somewhere if and only if Ra < 0 
in all such terms. If Sia < 0, the corresponding term of the twist converges 
everywhere and hence is entire. Otherwise, if ia = O our Theorem applies, 
securing meromorphic continuation whenever the exponent « does not exceed 1 /d. 
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We conclude with few details on the above remark about the order of F(s; B). In 
general, suppose that we have a function B(€) with expansion of type 


BS ye a, 


|ve|-<K - 
where 
lel >, ce KAM, RR (E) KAR EWK 


for certain positive constants 6, A and C and with a finite number of different 
exponents f;(€, «). Then a computation shows that the corresponding twist F(s; B) is 
of a finite order whenever entire. The assumption about the finiteness of the number 
of exponents can be relaxed, but instead we would need some uniform bounds for 
the involved coefficients a. This can be justified by means of the estimates in [9], 
see in particular Theorem 3 there. In particular, by the known bounds for the Bessel 
J-function we can prove that the twist in (3) is of order < | whenever entire. 
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The Sound of Fractal Strings and the Riemann 
Hypothesis 


Michel L. Lapidus 


Abstract We give an overview of the intimate connections between natural direct 
and inverse spectral problems for fractal strings, on the one hand, and the Riemann 
zeta function and the Riemann hypothesis, on the other hand (in joint works of 
the author with Carl Pomerance and Helmut Maier, respectively). We also briefly 
discuss closely related developments, including the theory of (fractal) complex 
dimensions (by the author and many of his collaborators, including especially 
Machiel van Frankenhuijsen), quantized number theory and the spectral operator 
Gointly with Hafedh Herichi), and some other works of the author (and several of 
his collaborators). 


Keywords Riemann zeta function * Riemann hypothesis (RH) * Quantization ¢ 
Quantized number theory ¢ Fractal strings * Geometry and spectra * Direct and 
inverse spectral problems for fractal strings * Minkowski dimension « Minkowski 
measurability * Complex dimensions * Weyl—Berry conjecture ¢ Fractal drums ¢ 
Infinitesimal shift * Spectral operator ¢ Invertibility * Quantized Dirichlet series 
and Euler product * Universality * Phase transitions * Symmetric and asymmetric 
criteria for RH 


1 Riemann Zeros and Spectra of Fractal Strings: 
An Informal Introduction 


Unlike an ordinary (Sturm—Liouville) vibrating string, which consists of a single 
interval (of length @, say), a fractal string consists of infinitely many intervals (of 
lengths €;, €2,--- ,€;,--+ , with £; | 0 as j — oo), vibrating independently of each 
other. Hence, the (eigenvalue or frequency) spectrum of a fractal string consists of 
the union of the spectra (counting multiplicities) of each of the countably many 
ordinary strings of which it is composed. 
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A fractal string (or, equivalently, its boundary, viewed as a compact subset of the 
real line R) always has (fractal) Minkowski dimension D between 0 and 1, the most 
extreme case D = 0 and D = 1 being referred to (following [74]) as the least and 
most fractal case, respectively, while the case when D = 1/2 is referred to (also as 
in [74]) as the midfractal case. The latter case will play a key role throughout this 
paper. 

By listening to a fractal string, one can detect whether or not one of its complex 
dimensions coincides with a nontrivial Riemann zero (that is, with a zero of the 
Riemann zeta function which is located in the critical strip 0 < Re(s) < 1). Indeed, 
it turns out that the Riemann zeta function ¢ = ¢(s) mediates between the geometry 
and the spectrum of a fractal string: 


f(s) = f(s) -Cv(s), () 


where €,,7(s) = €(s) := o,f; °° is the spectral zeta function of the fractal 
string 2, with {f,}?2, denoting the sequence of (suitably normalized) frequencies 
of “, written in nondecreasing order according to their multiplicities, and €y(s) := 
pea a is the geometric zeta function of &. This relation (discovered in [75, 76]) 
has played an important role in fractal string theory, as it enables one to understand 
why some of the complex dimensions of % (i.e., the poles of the meromorphic 
continuation of ¢) may be “canceled” by (nontrivial) zeros of ¢. Accordingly, 
certain oscillations which are present in the intrinsic geometry of the fractal string 
are no longer “visible” (or rather, “audible”’) in the spectrum of 2. 

More poetically, it is shown in [86, 87] that “One can hear the shape of a fractal 
string of dimension D 4 1/2” (in the sense of a certain inverse spectral problem, 
denoted by (ISP)p and to be specified in Sect. 6 below, and not in the original sense 
of Kac [65]) if and only if the Riemann hypothesis is true. Moreover, one cannot hear 
it for all of the fractal strings of dimension 1/2 (because ¢ = ¢(s) has zeros on the 
critical line Re(s) = 1/2). Hence, in the present approach, the truth of the Riemann 
hypothesis is equivalent to the existence of a (mathematical) phase transition at D = 
1/2 and at no other dimension D in the “critical interval” (0, 1). (See Theorems 6.15 
and 6.17 below.) By “hearing the shape of a fractal string” here, we mean that the 
inverse spectral problem (ISP)p in Sect. 6 below has an affirmative answer for any 
fractal string of dimension D. 

Phrased a little more precisely: one can hear the shape of a fractal string of a 
given (Minkowski) dimension D ¢€ (0, 1) if and only if the Riemann zeta function 
€(s) does not have any zero on the vertical line Re(s) = D. (See Theorem 6.14.) 
Furthermore, in general, one cannot hear the shape of a fractal string in the 
midfractal case where D = 1/2. (See Corollary 6.16.) Consequently, one can hear 
the shape of a fractal string in every possible dimension D € (0, 1) (other than 1/2) 
if and only if the Riemann hypothesis [150] is true; that is, if and only if 


f(s) =0, 0<Re(s) < 1 > Re(s) = - (2) 
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(See Theorem 6.15). These results have been established by Lapidus and Maier in 
[87] (announced in 1991 in [86]) building on the author’s earlier work [74] (see 
also [75, 76]) on a partial resolution of the Weyl—Berry conjecture in any dimension 
[6, 7]) as well as on the ensuing work of Lapidus and Pomerance [94] (announced 
in 1990 in [93]) on a resolution of the one-dimensional (modified) Weyl—Berry 
conjecture (of [74]) and its unexpected connections with the Riemann zeta function. 
(See Conjecture 5.9 and Theorem 5.11 in Sect. 5.) 

Later on, these results of [87] (which made use at the heuristic level of 
the intuition of complex dimensions) were reinterpreted (by Lapidus and van 
Frankenhuijsen in [99, 100]) in terms of the then rigorously defined notion of 
complex dimension, as well as extended to a large class of Dirichlet series and 
integrals, including all of the arithmetic zeta functions for which the generalized 
Riemann hypothesis is expected to hold. (See also [101, Chap. 9].) Moreover, a 
method similar to the one used in [87], but now relying in part on the explicit 
formulas established in [99-101] (and direct computations as well as on inverse 
spectral problems), was used to show that the Riemann zeta function, along with a 
large class of Dirichlet series and integrals (including most of the arithmetic zeta 
functions [131, 132, 157] and [101, Appendix A] or [79, Appendices B, C and E], 
other than the zeta functions of varieties over finite fields, for which the result clearly 
does not hold) cannot have an infinite vertical arithmetic progression of zeros. (See 
[101, Chap. 11] for this result and several extensions concerning the density of the 
Zeros.) 

Unknown at the time to Lapidus and van Frankenhuijsen of [99] (and of 
earlier papers on this and related subjects), this latter result about the zeros in 
arithmetic progression, in the special case of ¢, was already obtained by Putnam 
in [138, 139] by a completely different method, which could not be generalized to 
this significantly broader setting. This turned out to be quite beneficial to the general 
theory of complex dimensions as it led us in [99-101] to significantly improve and 
refine the authors’ original pointwise and distributional explicit formulas. (See, e.g., 
[101, Chap. 5].) 

In the rest of this paper, we will be more specific and explain what type of inverse 
spectral problem is involved here. (See, especially, Sect. 6.) First of all, we will 
need to precisely define (in Sect. 2) what is a fractal string as well as its Minkowski 
dimension and content. 

We will then recall (in Sect. 3) results from [93, 94] providing a characterization 
of the notions of Minkowski measurability and nondegeneracy, which will play 
a key role in Sect.5 and part of Sect.6 (as well as serve as a motivation for 
some aspects of Sect. 7.4). In Sect.4, we will discuss Weyl’s asymptotic formula 
and conjecture [180, 181] for the spectral asymptotics of drums with smooth (or 
sufficiently “regular’’) boundary, as well as the Weyl—Berry conjecture [6, 7] for 
drums with fractal boundary (or “fractal drums”) and its partial resolution obtained 
in [74]. In Sect.5, we will present the resolution of the modified Weyl—Berry 
(MWB) conjecture [74] for fractal strings obtained in [93, 94], thereby establishing 
a precise connection between the corresponding direct spectral problem and the 
Riemann zeta function € = ¢(s) in the critical interval 0 < s < 1. In Sect. 6, we will 
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introduce the aforementioned inverse spectral problem (ISP)p, for each D € (0, 1), 
and then precisely state the results of [86, 87] connecting it with the presence of 
zeros of the Riemann zeta function in the critical strip 0 < Re(s) < 1 (the so-called 
critical or nontrivial zeros), and thereby, with the Riemann hypothesis. 

Finally, in Sect. 7, we will discuss a variety of topics, closely connected to (or 
motivated in part by) the above developments. The subjects to be discussed include 
the mathematical theory of complex dimensions of fractal strings developed in [99- 
101] (see Sect. 7.2), its higher-dimensional counterpart recently developed in [108- 
113, 115, 116] (see Sect. 7.3), as well as aspects of “quantized number theory” (see 
Sect. 7.4) developed in [53] (and [49-52] along with [80]). In particular, in the latter 
subsection, we will see that the aforementioned inverse spectral problem (ISP)p can 
be rigorously reinterpreted in terms of the invertibility (or “quasi-invertibility”) of 
the “spectral operator” a = ¢(0), with the “infinitesimal shift” (of the real line) 0 
now playing the role of the usual complex variable s in the definition of the quantum 
(or operator-valued) analog of the classic Riemann zeta function ¢ = ¢(s). 

We close this introduction by providing several relevant references. For general 
references concerning the theory of the Riemann zeta function and related aspects 
of analytic number theory, we mention, for example, [29, 58, 59, 66, 79, 101, 131- 
133, 165, 174] along with the relevant references therein. For fractal string theory 
and the associated theory of complex dimensions, along with their applications to 
a variety of subjects, including fractal geometry, spectral geometry, number theory, 
and dynamical systems, we refer to [101], along with [30, 31, 44, 47, 49-53, 72- 
88, 90-97, 103-116, 119, 130, 134, 135, 140, 144, 172, 173, 183, 184] and the 
relevant references therein. In particular, Chap. 13 of [101] provides an exposition 
of a number of recent extensions and applications of the theory, including to fractal 
sprays (higher-dimensional analogs of fractal strings [95]) and self-similar systems 
({101, $13.1], based on [91, 92, 104, 105, 134, 135]), p-adic (or nonarchimedean) 
geometry ({101, $13.2], based on [83-85, 107, 114]), multifractals ({101, §13.3], 
based on [30, 96, 103]), random fractal strings ({101, $13.4], based on [44]), as well 
as fractal membranes and the Riemann (or modular) flow on the moduli space of 
fractal membranes ([{101, §13.5], based on the book [79] and on [88]). As a general 
tule, we will give specific references to the most recent monograph [101] rather than 
to the earlier research monographs [99, 100]. 


2 Fractal Strings and Minkowski Dimension 


A (nontrivial) fractal string is a bounded open set £2 C R which is not 
a finite union of intervals. Hence, §2 is an infinite countable disjoint union of 
(bounded) intervals J;, of lengths €; = [Jj], for j = 1,2,--- : 2 = UR Jj. Since 
|2Q| = pa €; < oo, we may assume without loss of generality that (¢;)%°, is 
nonincreasing and so ¢; | 0 as j — oo. Furthermore, for our purposes, we may 
identify a fractal string with its associated sequence of lengths (or scales), 2 := 
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(€)) 21, written as above (counting multiplicities). Indeed, all of the geometric 
notions we will work with, such as V(e), the Minkowski dimension and content, 
as well as the geometric zeta function and the complex dimensions, depend only 
on 2 = (€;)%, and not on the particular geometric realization of 2 as a bounded 
open subset (2 of R. 

Given ¢ > 0, the (inner) e-neighborhood (or inner tube) of 02 (or of &) is 


given by 
Q, = {x € 2: d(x, 022) < 8}, (3) 


where 02 denotes the boundary of {2 (a compact subset of R) and d(-, dS2) denotes 
the (Euclidean) distance to 092. 
In the sequel, we will let 


V(e) = Ve(e) := |S2el, (4) 


the volume (really, the length or one-dimensional Lebesgue measure) of (2,. As 
was mentioned above, it can be shown (cf. [93, 94]) that V(e) depends only on @ 
(and not on the particular geometric realization 2 of &). Then, given d > 0, the 
d-dimensional upper Minkowski content of @ (or of 092) is given by 


Mj = Mj (L):= lim eo (5) 
e>0t € 


l=d° 


and similarly for the d-dimensional lower Minkowski content “aq = Ba(ZL), 
except for the upper limit replaced by a lower limit. 

The Minkowski dimension of @ (or of its boundary 092),' denoted by D = Dy, 
is defined by 


D:= inf{d>0: 47 < oo} (6) 
= sup{d >0:.4@7 = +00}. 


We note that since || < oo, we always have 0 < D < 1. Furthermore, from 
the physical point of view, D will play the role of a critical exponent (or critical 
parameter): it is the unique real number D such that .4* = +00 ford < D and 
Mj = 0 ford > D. 

The upper (resp. lower) Minkowski content of @ is defined by 4* := Jj 
(resp., Ws. := Msp). We always have 0 < & < @* < wo. If 0 < 
M,(<) M* < oo, then # is said to be Minkowski nondegenerate. If, in addition, 
M,, = M* (i.e., if the upper limit in Eq. (5) is a true limit in (0, +00), with d := D), 


'This is really the upper Minkowski dimension of 0, relative to Q, but we will not stress this 
point in this paper (except perhaps in Sect. 7.3). 
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then we denote by .@ this common value, called the Minkowsi content of &, and 
the fractal string & (or its boundary 0£2) is said to be Minkowski measurable. So 
that @ = lim,_,9+ V(e)/e!? and 0 < a < oo. 

We close this section by stating a theorem that shows the intimate connections 
between the geometric zeta function ¢y of a fractal string & [introduced in Sect. 1 
above, just after Eq. (1)], and the Minkowski dimension of &. (Recall that Cy(s) = 
pe e , for all s € C with Re(s) sufficiently large.) This result was first observed 
by Lapidus in [75, 76], using earlier work of Besicovich and Taylor in [8], and has 
since then been given several direct proofs in [101, Theorem 1.10] and in [101, 
Theorem 13.111]; see also [114]. It also follows from general results in [110, 116]. 


Theorem 2.1 ([75, 76]). Let &@ be a fractal string. Then, the abscissa of conver- 
gence of ¢ coincides with the Minkowski dimension of @ : 0 = D. 


Recall that the abscissa of convergence of ¢ ¥ is given by 


o:=inf}@ eR: & < ooh. (7) 


j=l 


It follows from this definition and from known results about Dirichlet series with 
positive coefficients (see, e.g., [165, §VI.2.2 and §VI.2.3]) that Cy = Cy(s) is 
holomorphic for Re(s) > D and that the open right half-plane {Re(s) > D} is 
the largest right half-plane (of the form {Re(s) > aw}, for some a € R U {+00}) to 
which ¢ y can be holomorphically continued as well as the largest such half-plane in 
which the Dirichlet series }°°° , £; is absolutely convergent (and hence, convergent). 
Therefore, ¢v does not have any pole in {Re(s) > D}. (We refer, e.g., to [165, §V1.2 
and §V1.3] or to [45] for an introduction to the theory of Dirichlet series.) 

Furthermore, note that s = D is always a singularity of Cy (i.e., Cy(s) > +00 
as s > Dt, s € R) and therefore, if £4 can be meromorphically continued to an 
open (connected) neighborhood of D, then D is a pole of ¢v. Theorem 2.1 above 
(according to which D = o), along with this last observation, is one of the original 
justifications for calling the poles of a meromorphic continuation of ¢ (to an open 
connected neighborhood of {Re(s) > D}) the (visible) complex dimensions of the 
fractal string &. (See [99-101] and, for a brief introduction, see Sect. 7.2 below.) 

Here and thereafter, {Re(s) > a} stands for {s € C : Re(s) > a}. By convention, 
for @ = —oo or +00, respectively, it coincides with C or the empty set 9. Similarly, 
if a € R, {Re(s) = a} denotes the vertical line {s € C : Re(s) = a}. 


3 Characterization of Minkowski Measurability 
and Nondegeneracy 


We recall here some of the joint results of Lapidus and Pomerance obtained in [93, 
94]. Further results from that work will be discussed in Sect. 5. 
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Theorem 3.2 (Characterization of Minkowski Measurability [94]). Let 2 = 
(€)) be a fractal string of Minkowski dimension D € (0,1). Then, Z is Minkowski 
measurable if and only if 


£,~ Lj"! asj > 00, (8) 
for some constant L € (0,+00). In that case, the Minkowski content of &@ is 
given by 


gi-D 


M = i=D 


i (9) 


Equation (8) precisely means that the limit of £; - j'/® exists in (0, +00) and is 
equal to L. Furthermore, note that (8) is equivalent to 


Neg(x) ~ Mx? as x > +00, (10) 


with M := L” and where Nv, the geometric counting function of @, is defined for 
all x > 0 by 


Nex) = #( = 1:6" <x). (11) 


Here and thereafter, #B denotes the number of elements of a finite set B. Moreover, 
Eq. (10) precisely means that x? Nyv(x) > M asx — +00, for some M € 
(0, +00). 

Although we will not use this result explicitly, it is helpful to also give the 
counterpart of Theorem 3.2 for Minkowski nondegeneracy. Let a, (resp., a*) be 
the lower (resp., upper) limit of €; - ji/> as j — ox; so that we always have 
0<a, <a* < oo. 


Theorem 3.3 (Characterization of Minkowski Nondegeneracy [94]). Let & be 
a fractal string of dimension D € (0,1). Then, @ is Minkowski nondegenerate (i.e., 
0 < 4,(<)M* < oo) if and only if 


0 < ax (<)a* < oo. (12) 


Concretely, Eq. (12) means that there exist positive constants c, C > | such that 
cP < £; < cf '/? for all j = 1 or, equivalently, C7!x? < Ny(x) < Cx? for 
allx > 0. 

Let N, = N,v denote the spectral (i.e., frequency) counting function of &. 
Hence, N, (x) := #({f € o(Z@) : f < x}) for all x > 0, where o(-2) = {n- £5" : 
n> 1,j = 1} denotes the (frequency) spectrum of @. (In essence, the frequencies of 
£ are, up to a multiplicative normalizing positive constant, equal to the square roots 
of the eigenvalues of ”; that is, of the eigenvalues of the Dirichlet Laplacian —A = 
—d*/dy* on any geometric representation 2 of Y by a bounded open subset of R.) 
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It is also shown in [94] that Eq. (12) (and hence also, the Minkowski nondegeneracy 
of &, according to Theorem 3.3) is equivalent to 


0 < ,(<)5* < 00, (13) 


where 6, (resp., 5*) denotes the lower (resp., upper) limit of @, (x)/x? as x > +00, 
and g(x) is the “asymptotic second term” for N,(x); namely, g,(x) := W(x) — 
N,(x), with W(x) := |Q2|x being the Weyl (or leading) term to be discussed in 
Sects. 4.1, 5, and 6 below. (In general, one always has g,(x) > 0 for all x > 0 and 
thus 0 < 6, < 6* < 00.) 

Equation (13) means that there exist c;} > 1 such that cp'x? < g(x) < 
c,x? for all x > 0; in other words, the error estimates of [74], to be discussed 
further on in Theorem 4.8 of Sect. 4.1, are sharp in this case. More specifically, 
when N = 1 and D € (0,1), each of the equivalent conditions (12) and (13) 
characterizes the sharpness of the remainder estimates of [74], recalled in Eq. (19) 
(see Theorem 4.8(i) below). 

It is natural to wonder whether the counterpart of Theorem 3.3 and of its 
complement for N, holds for one-sided (rather than two-sided estimates). The 
answer may be somewhat surprising to the reader. In short, it is positive for the 
upper estimates but negative for the lower estimates. More specifically, still in [94], 
it is shown that 


M* <0 & t= O(7"”) asj > 00 (ie., a* < 00) & Nv(x) = O(x”) 
(14) 


asx > +00 & 9, (x) = O(x”) as x > +00, (ie., 6* < 00), 


and, more generally, that the exact counterpart of (14) holds for any d > 0, provided 
D and .@* = dj; are replaced by d and .4@/, respectively. This provides, in 
particular, a converse (in the present one-dimensional case) to the error estimate 
obtained in [74] for the asymptotic second term of N,,(x) to be discussed next; see 
part (7) of Theorem 4.8 below, specialized to the case where N = 1 and D € (0, 1). 

Moreover, it is shown in [94] by means of an explicit counterexample that in the 
analog of (14) for .4,, the implications in one direction holds, but not in the other 
direction. For example, it is not true, in general, that.Z% >0 <3 as >0<@ 6, > 0. 

Here, in Eq.(14), as well as in the sequel, given f : [0,+00) — R and 
g : [0,+00) — [0,+00), one writes that f(x) = O(g(x)) to mean that there 
exists a positive constant co such that |f(x)| < c2g(x), for all x sufficiently large. 
(For the type of functions or sequences we will work with, we may assume that 
this inequality holds for all x > 0.) We use the same classic Landau notation for 
sequences instead of for functions of a continuous variable. 

In closing this section, we note that under mild assumptions on & (about the 
growth of a suitable meromorphic continuation of its geometric zeta function Cy), it 
has since then been shown that within the theory of complex dimensions developed 
in [99-101], the characterization of Minkowski measurability obtained in [94] 
(Theorem 3.2 above) can be supplemented as follows (see [101], Theorem 8.15]), 
under appropriate hypotheses. 
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Theorem 3.4 (Characterization of Minkowski Measurability Revisited 
[94, 101]). Let & be a fractal string of Minkowski dimension D. Then, under 
suitable conditions on Cy (specified in [101, Sect. 8.3]), the following statements 
are equivalent: 


(i) & is Minkowski measurable. 
(ii) | Condition (8), or equivalently (10), holds. 
(iii) | Dis the only complex dimension of & with real part D, and it is simple. 


Moreover, if any of these equivalent conditions is satisfied, then the Minkowski 
content of @ is given by 


M _91-p res(€¥,D) 


. as 
—D D(1—D)’ 


(15) 


— 


where res(€v, D) denotes the residue of &y(s) ats = D, M := LP and Land M 
are given as in (8) and (10), respectively. 


Recall that the complex dimensions of are the poles of a (necessarily unique) 
meromorphic continuation of ¢y (to a connected open neighborhood of {Re(s) > 
D}). According to a result of [75, 76] (see Theorem 2.1 above), there are no complex 
dimensions with real parts > D; see the discussion following Eq. (7) above. 


Remark 3.5. When £ is a self-similar string (in the sense of [101, Chap. 2] ie., 
the boundary of Y is a self-similar set), it is shown in [101, Theorems 2.16, 8.23 
and 8.36] that no hypothesis on ¢y is needed in the counterpart of Theorem 3.4, 
and that either of the equivalent statements (i), (ii), or (411) of Theorem 3.4 is true 
if and only if @ is nonlattice; i.e., iff the subgroup of R generated by the logarithms 
of its distinct scaling ratios is not of the form 9Z, for some @ > 0. If & is lattice, 
then it is Minkowski nondegenerate but is not Minkowski measurable. 


Finally, we mention that the original proof of Theorem 3.2 given in [93, 94] was 
analytical and combinatorial in nature. Several parts of the proof have since been 
established in a different manner by Falconer in [31], using some techniques from 
the theory of dynamical systems, and more recently (and perhaps most concisely), 
by Rataj and Winter in [144] using techniques from geometric measure theory (in 
particular, from [143, 169] and the relevant references therein). 


Remark 3.6. We also note that the notion of Minkowski dimension was introduced 
(for noninteger values of the dimension) by Bouligand in the late 1920s in [16]. The 
notion of (normalized) Minkowski content was introduced by Feder in [33], while 
that of Minkowski measurability was apparently first used by Stacho in [169]. 

The Minkowski dimension is also often called “box dimension” (see, e.g., 
(32, 97, 101, 121, 122, 175] or the applied literature on fractal dimensions), entropy 
dimension, or capacity dimension, for example. Contrary to the Hausdorff dimen- 
sion, it is not o-stable [that is, it is usually not true that D(U?2 Ax) = sup, ; D(Ax)], 
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where D(A) denotes the upper Minkowski dimension of A; see, e.g., [32, 122, 175]. 
Furthermore, unlike the Hausdorff measure (which is a true positive Borel measure, 
in the usual mathematical sense of the term [21, 32, 35, 97, 122]), the Minkowski 
content (when it exists) or more generally, the upper Minkowski content, is not 
a measure (it is only finitely sub-additive). In fact, as is pointed out in [74] (see 
also [75, 76]), and somewhat paradoxically, this is precisely because it does not 
have all of these desirable mathematical properties that the Minkowski dimension 
is important in the study of aspects of harmonic analysis as well as of spectral and 
fractal geometry, including the study of the vibrations and spectra of fractal drums or 
“drums with fractal boundary” ([74-76], see also Sect. 4 below) and, in particular, 
of fractal strings (i.e., one-dimensional drums with fractal boundary). See, e.g., [74, 
Examples 5.1 and 5.1’]. 


4 The Weyl-Berry Conjecture for Fractal Drums 


In this section, we first briefly recall Weyl’s classic formula for the leading 
spectral asymptotics of ordinary (smooth or piecewise smooth) drums, as well 
as corresponding extensions and sharp remainder estimates (from [74]), valid 
for general fractal drums and providing a partial resolution of the Weyl—Berry 
conjecture [6, 7] in any dimension; see Sect. 4.1. (In Sect. 5, we will specialize the 
situation to the one-dimensional case, which is the main focus of this paper.) In the 
latter part of this section, we will comment on various aspects of Weyl’s formula 
for fractal drums, both in the case of drums with fractal boundary (which is of most 
interest here) and in the related case of drums with fractal membrane (corresponding 
to Laplacians and Dirac operators on fractals themselves); see Sect. 4.2. We will also 
conclude this section by briefly mentioning some of the physical and technological 
applications of these mathematical results about fractal drums and of the original 
(or modified) Weyl—Berry conjecture. 


4.1 Weyl’s Asymptotic Formula with Sharp Error Term 
for Fractal Drums 


Hermann Weyl’s classic asymptotic formula ({180, 181]; see also, e.g., [23, 146]) 
for the frequency (or spectral) counting function N, = N,(x) of an ordinary 
(N-dimensional) drum can be stated as follows: 


Nv (x) = Cy|2|wa™ + of) (16) 


°A simple counterexample is provided by A := {1/k : k > 1} and Ay := {}} for each k> 1, 
viewed as subsets of R; then, D(A) = 1/2, whereas D(A,) = 0 for all kK > 1 and hence, 
supy> 1 D(Ax) = 0. 
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as x — +00. Here, || := |2|y denotes the volume of the bounded open set 
2 Cc RY (ie., the N-dimensional Lebesgue measure of 2). Furthermore, given 
x > 0,N,(x) is the number of (suitably normalized) frequencies f of the drum not 
exceeding x, and Gy is an explicitly known positive constant which can be expressed 
in terms of the volume of the unit ball of R” [and therefore, in terms of appropriate 
values of the gamma function J” = I"(s)]. In the sequel, we denote the leading term 
in (16) by 


W(x) := G|Q|vx" (17) 


and call it the Weyl term. In addition, mathematically, the spectrum of the (N- 
dimensional) “drum” is interpreted as the spectrum of the Dirichlet Laplacian —A 
on a given nonempty bounded open set 2 C R(N > 1), with boundary 0. 
Furthermore, the Dirichlet boundary conditions are interpreted variationally (or in 
the distributional sense); see, e.g., [17, 120] or [74, §2]. 

Since the Laplacian is a second order (self-adjoint, positive) linear operator, 
the aforementioned (normalized) frequencies are (up to a multiplicative constant) 
equal to the square roots of the eigenvalues. In the present situation, the spectrum 
is discrete and hence, we can order these frequencies in nondecreasing order (and 
according to their multiplicities) as follows: 


0<fi<fp<--+<fi <:::, withf, > +00 asn > oo. (18) 


Remark 4.7. Physically, W(x) can be interpreted as a volume in phase space. More 
specifically, W(x) is proportional to 


{@, 6) € 2x RY EP <x }lw = [BO Dw [Aly 2%, 


where R” x RY ~ R is the “phase space” (the space of positions and velocities 
or equivalently, momenta, of the classical particle) and for 9 > 0, B(O, @) denotes 
the ball of center the origin and radius @ in RY. 


In [6, 7], extending to the fractal case a classic conjecture of the mathematician 
Weyl [180, 181] in the “regular” (or “smooth” case),* the physicist Michael Berry 
has conjectured that (16) should be completed to obtain an asymptotic second term 
for the spectral counting function N,(x), of the form —Cy,47H4(d2)x" =: S(x), 
where H is the Hausdorff dimension of the boundary 02 (H € [V—1,N]), H(0L) 
is the H-dimensional Hausdorff measure of 092 (a well-known fractal generalization 
to noninteger dimensions of the notion of “volume”; see, e.g., [32, 33,97, 122, 175]), 
and Cyn is a positive constant independent of $2 and depending only on N and H 


3See, e.g., [55-57, 60-62, 71, 124, 125, 162-164] (along with the relevant references therein and 
in [101, §12.5 and Appendix B]) for results (in the smooth case) concerning the Weyl conjecture 
about the asymptotic second term of the spectral counting function N, (x). 
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(as well as expressed in terms of the gamma function, by analogy with the known 
results in integer dimensions for simple examples such as N-dimensional cubes). 

Unfortunately, it turns out that Berry’s conjecture (called the Weyl—Berry 
conjecture in [74] and in the literature since then), although very stimulating, is 
not correct, as was first noted by Brossard and Carmona by means of an explicit 
counterexample in [18] and then, explained from a mathematical point of view 
(and illustrated by a family of even simpler counterexamples) in [74]. (See, in 
particular, [74, Examples 5.1 and 5.1’].) Furthermore, H should be replaced by 
D, the (inner) Minkowski dimension of 022, and #;(082) might reasonably be 
replaced by -@p(02), the (inner) Minkowski content of 02. Finally, as was shown 
in [75, 76, 94, 95], even the expected constant Cy,4 (when it exists) does not simply 
take the form of Cy.p (where D is the Minkowski dimension of 02) but whatever 
replaces the factor of proportionality in the counterpart of S(x) should merely be 
expressed in terms of the residue at s = D of the meromorphic continuation 
of the corresponding spectral zeta function ¢,(s) = °°, f°, where (f,)%, is 
the sequence of frequencies of the drum, as given in Eq. (18) above. In fact, in 
Sect. 5, we will see that when N = 1 (the special case of fractal strings instead of 
higher-dimensional fractal drums) and when 02 (or, equivalently, the fractal string) 
is Minkowski measurable (with Minkowski content denoted by .@), then C,.p is 
proportional to the positive number —¢(D), where ¢ = ¢(s) is the classic Riemann 
zeta function and D € (0, 1). (See Theorem 5.11 below, where in light of (23), Ci.p 
is of the form cp.@, with cp given by Eq. (24).) 

On the positive side, the following partial resolution of the Weyl—Berry conjec- 
ture was obtained by Lapidus in [74] (recall that the Weyl term W(x) is given by (17) 
above and is therefore proportional to x”): 


Theorem 4.8 (Sharp Error Estimates [74]). Let 82 be any (nonempty) bounded 
open subset of IR“. Recall that we always have D € [N—1,N], where D = D(02) is 
the (inner) Minkowski dimension of 02; see comment (b) in Sect. 4.2 just below. 


(i) Then, in the “fractal case” where D € (N — 1,N], we have 
N, (x) = W(x) + O(x”) as x > +00, (19) 


provided M;(02) < 00, where the (pointwise) remainder estimate O(x?) is 
sharp for every D € (N —1,N). Furthermore, if 5 (092) = +00, then D 
should be replaced by D + ¢, for any arbitrarily small ¢ > 0. 

(ii) In the “nonfractal case” where D = N — 1 (which is the case, for example, 
if the boundary 082 is piecewise smooth or, more generally, locally Lipschitz), 
then exactly the same result as in part (i) holds, except for the fact that the error 
term now takes the form O(x? log x) as x > +00 (with D := N — 1). 


When .4 (082) = +00, either in case (i) or (ii) of Theorem 4.8, one should try 
to use the later extension of this theorem obtained in [47] and expressed in terms 
of generalized (upper) Minkowski contents, relative to suitable gauge functions; see 
comment (g) in Sect. 4.2 just below. 
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Note that in the most fractal case when D = N, the remainder estimate (19) still 
holds but is clearly uninteresting because then, the error term is of the same order 
as the leading term W(x) given by Eq. (17). 


4.2 Further Comments, Extensions, and Applications 


We next comment on various aspects and extensions (in related contexts) of 
Theorem 4.8 and of the Weyl—Berry conjecture. Just as in Sect. 4.1, our discussion 
is not meant to be exhaustive in any way but simply aims at providing various 
pointers and references where the interested reader can find a lot more detailed 
information. 


(a) The nonfractal case where D := N — | (see part (11) of Theorem 4.8 just above) 
was essentially already obtained by Métivier in [128] (see also [126, 127]), in 
a slightly less precise form and using a different terminology (not explicitly 
involving the notions of Minkowski dimension and content). 

(b) In Theorem 4.8, the (inner) Minkowski dimension D and (inner) upper 
Minkowski content .45 := .@;(082) are defined exactly as in the one- 
dimensional case where N = | (see Eqs. (6) and (5), respectively), except that 
on the right-hand side of (5) (with d := D), V(e)/e!~? should be replaced 
by V(e)/e’”. Since 2 C RY is bounded, it has finite volume and therefore 
it is easy to check that D < N. Furthermore, since 0{2 is the boundary of a 
(nonempty) bounded open set, its topological dimension is equal to N — 1 and 
hence, D > N — 1. Consequently, as was observed in [74], we always have 
N-1<D<N. (For the special case of fractal strings, we have N = 1 and 
we thus recover the fact that 0 < D < 1; see the statement following Eq. (6) in 
Sect. 2.) 


In [74], the case where D = N — 1 is called the least or nonfractal case, the case 
where D = N is called the most fractal case, while the case where D = N — ; (.e., 
the codimension N — D = {D} is equal to 1/2, where {D} € [0, 1) is the fractional 
part of D), is referred to as the midfractal case. It turns out that each of these cases 
(where the codimension N — D takes the value 0, 1 or 1/2, respectively)* plays an 
important role in the proof of Theorem 4.8 (and its generalizations) given in [74] 
(as well as in related spectral or geometric results obtained in [74, Examples 5.1 
and 5.1’, along with Appendix C)). 


(c) To show that the remainder estimates are sharp (in case (i) of Theorem 4.8 
where D € (N — 1,N) and .4@5 (082) < oo), a simple one-family of examples 
is constructed in [74, Example 5.1 (V = 1) and Example 5.1’ (N > 1)]. When 
N = 1, itis of the form 2, := U2) (G+ Il)“, 7“), where a > 0 is arbitrary, so 


‘Strictly speaking, when D = N — 1,N — D = 1 is not equal to {1}. 
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that 02, = {O}U{j~* : j = 1}. It follows that for every a > 0, H = 0 (since 022, 
is countable), whereas D = (a+1)~!, 092, is Minkowski measurable (as defined 
in Sect. 2 above and shown in [74]) with Minkowski content 4 := .@p(092) 
equal to 2!~?a?/(1 — D); see [74, Appendix C]. This was the first explicit 
example of a fractal string and served as a motivation for the formulation of 
Theorem 3.2 above (the characterization of Minkowski measurability for fractal 
strings, obtained in [94]) and the MWB conjecture, stated in [74] and to be 
discussed in Sect.5 below (see Conjecture 5.9). Note that the midfractal case 
where D = 1/2 corresponds to a = 1, while the most (D = 1) and least (D = 0) 
fractal cases correspond, respectively, to the limits a > 0* and a > +00. 
Furthermore, the symmetry a <> 1/a exchanges D and 1 — D. 


Finally, when N > 2, exactly the same statements as above are true for the N- 
dimensional analog (a fractal comb) of 2,, defined by Qy.y := Qq x [0, 1] —! We 
then have that H = N—1,D = N—1+ (a+ 1)7!, 02 ,y is Minkowski mea- 
surable and .@p(0Q,.v) has the same value as above. Furthermore, the remainder 
estimate (19) of part (7) of Theorem 4.8 is still sharp for every a > 0 [and hence, for 
every D € (N —1,N)]; see [74, Example 5.1’]. Actually, it follows from the later 
results (from [94]) recalled in Sect. 5 below that for every a > 0, 


N, (x) = W(x) — cn.p -Hp(92)x? + o(x”) as x > +00, (20) 


where {2 = Q,,, W(x) is given by (17), and cy.p is explicitly known. 


(d) Under suitable hypotheses, Theorem 4.8 has an exact counterpart for the 
Neumann Laplacian (instead of the Dirichlet Laplacian), with the boundary 
conditions interpreted variationally (as in [17, 120] or [74, §2]) as well as 
for higher order, positive self-adjoint elliptic operators with (possibly) variable 
coefficients and with Dirichlet, Neumann, or mixed boundary conditions; see 
[74], Theorem 2.1 and its corollaries. For the Neumann Laplacian, the error 
estimate (19) holds, for instance, for the classic Koch snowflake domain 
and, more generally, for all planar quasidics [123, 136] (e.g., for the simply 
connected planar domains bounded by the Julia sets of the quadratic maps 
zt» 2 +, where the complex parameter c is sufficiently small). 

In [18], for the Dirichlet Laplacian and using probabilistic methods, Brossard 
and Carmona have obtained error estimates (as t — O*) for the partition 
function Z, = Z,(t) := Tr(e'4) (the trace of the heat semigroup), of interest 
in quantum statistical mechanics, probability theory, harmonic analysis, and 
spectral theory. These estimates are also expressed in terms of the (inner) 
Minkowski dimension D. We should note, however, that even though part (i) of 
Theorem 4.8 implies those error estimates, the converse is not true, in general. 
Indeed, typically, beyond the leading term (for which a classic Tauberian 
theorem can be used in order to show the equivalence of the corresponding 
results for N,,(x) and Z,(f) (see, e.g., [65, 168]), pointwise asymptotic results 
for N, (x) are considerably more difficult to obtain than for Z, (t). 


(e 


wm 


Fractal Strings and the Riemann Hypothesis 215 


(f) The Weyl—Berry conjecture for drums with fractal boundary (and its modifica- 
tions in [74]) has since been studied in a number of different contexts, both 
analytically and probabilistically. We refer to [101, §12.5] for a number of 
references on the subject, including [18, 34, 38, 39, 44, 47, 74-76, 86, 87, 93- 
95, 99-101, 109, 116, 129, 176]. We also refer [40, 55-57, 60-62, 71, 74-76, 
124, 125, 162-164] and [101, Appendix B] for related references (including 
[40] in the case of partition functions) on the spectral asymptotics of smooth 
(as opposed to fractal) drums. A general introduction to Weyl’s asymptotic 
formula and its analytic or probabilistic proof (for smooth or piecewise smooth 
boundaries, for example) can be found in [23, 40, 65, 146, 168]. 


Finally, we mention that the original Weyl—Berry conjecture was also formulated 
for “drums with fractal membrane” (as opposed to “drum with fractal boundary’’). 
An appropriate modification of the conjecture was established in that setting by 
Kigami and Lapidus in [69] (building on a conjecture of [76]) for a large class of 
self-similar fractal drums corresponding to Laplacians on fractals (the so-called 
“finitely ramified” or p.c.f. self-similar sets), such as the Sierpinski gasket, the 
pentagasket, and certain fractal trees. In particular, in [69] is established a suitable 
analog of Weyl’s classic asymptotic formula, but now for the leading spectral 
asymptotics of Laplacians on (self-similar) fractals (see, e.g., [68]) rather than on 
bounded open sets with fractal boundary. The resulting semi-classical formula, 
along with its geometric interpretation, is further explored and extended from 
the point of view of nonsmooth geometric analysis and Connes’ noncommutative 
geometry [22], in [70, 77, 78] and (in a somewhat different setting and using suitably 
constructed Dirac operators and intrinsic geodesic metrics on the fractals under 
consideration), in [20, 98]. 

Beside [69, 76], see also, e.g., [101, §12.5.2] for many other references on (or 
related to) this subject, including [20, 26, 36, 42, 43, 70, 72, 73, 77, 78, 98, 141, 142, 
153-155, 166, 172, 173]. 


(g) Theorem 4.8 above (from [74]), along with the results from [94] stated in Sect. 3 
above and Sect. 5 below, and a part of the results from [87] discussed in Sect. 6 
below have been extended by He and Lapidus in the research memoir [47] to 
the more general situation where the (now generalized) Minkowski content 
is no longer defined in terms of a strict power law, but in terms of a large 
class of “gauge functions” involving, for example, expressions of the form 
x? log x, x? log log x,--- or x”/ log log log x, etc. This is of interest in the appli- 
cations to fractal geometry, spectral geometry, harmonic analysis, probability 
theory, stochastic processes, and mathematical or theoretical physics. 


We close this section by briefly commenting on the physical relevance of Weyl’s 
asymptotic formula (see [65] for an interesting overview), Theorem 4.8 above (from 
[74]) and the Weyl—Berry conjecture which partially motivated it. The Weyl—Berry 
conjecture, along with its various modifications and extensions, has potential or 
actual physical and engineering applications to condensed matter physics, quantum 
mechanics, quantum chemistry, quantum chaos, acoustics, diffusions, and wave 
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propagation in fractal or random media (i.e., on or off fractals), geophysics, radar 
and cell phone technology (fractal antennas), as well as the making of computer 
microchips. (See, e.g., [6, 7, 20, 65, 70, 72-79, 89, 98, 102, 141, 142, 156, 159] and 
the relevant references therein.) 

We will next focus on the one-dimensional case (i.e., NV = 1), as in the rest of this 
paper (with the exception of the present section), and therefore on the connections 
between this subject and aspects of number theory, particularly the Riemann zeta 
function and the Riemann hypothesis, in Sects. 5 and 6, respectively. 


5 Direct Spectral Problems for Fractal Strings 
and the Riemann Zeta Function in the Critical Interval 


Having discussed in Sect. 4 the Weyl classical asymptotic formula (with error term) 
and the closely related Weyl—Berry conjecture, we may now specialize the situation 
to the one-dimensional case (that of fractal strings, corresponding to N = | in 
Sect.4.1) and discuss the key result of [94] establishing the (one-dimensional) 
“MWB conjecture” for fractal strings; see Theorem 5.11 below. This brings to the 
fore direct connections between the spectra of fractal strings and the Riemann zeta 
function ¢ = C(s), in the case of the “critical interval” 0 < s < 1. (See also Eq. (1) 
above for a related general formula.) This connection will be further explored in 
Sect. 6 (based on [87]), in relation with the critical strip 0 < Re(s) < 1, the Riemann 
hypothesis and the converse of Theorem 5.1 1; that is, an inverse (rather than a direct) 
spectral problem for fractal strings; see, in particular, Theorems 6.14 and 6.15 (from 
[87]) in Sect. 6 below. 

Let 92 be an arbitrary (nontrivial) fractal string (i.e., a bounded open subset of 
R), of Minkowski dimension D € (0, 1) and associated sequence of lengths 2 = 
(€)) 21, written in nonincreasing order (according to multiplicities) and such that 
£; | Oas j — ov; see Sect. 2. In the sequel, we will refer to such a fractal string as 
2 or L, interchangeably. 

Recall from our discussion in Sects. | and 4.1 that a one-dimensional fractal drum 
(i.e., drum with fractal boundary) is nothing but a fractal string. Furthermore, recall 
from Sect. 2 that the (Minkowski) dimension of a fractal string always satisfies 0 < 
D < 1. In the sequel, we exclude the extreme cases where D = 0 and D = 1 (the 
least or nonfractal case and the most fractal case, respectively) and therefore assume 
that D belongs to the critical interval (0,1) :0< D< 1. 

Letting N = | in the expression (17) for the Weyl term [i.e., the leading term 
in Weyl’s asymptotic formula (16)] W = W(x), we obtain (given our current 
normalization for the frequencies) 


W(x) = |Q|, x, forx > 0. (21) 
So that, according to Weyl’s classic formula (16), we have 


N,(x) = W(x) + RQ), (22) 
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where W(x) is given by (21) and R(x) = o(x) as x > +00. Furthermore, provided 
M* < o (ie., Z has finite upper Minkowski content), then according to part (i) 
of Theorem 4.8 above (from [74]), the error term R(x) in (22) can be estimated as 
follows: R(x) = O(x?) as x > +00. Moreover, if & is Minkowski nondegenerate 
(ie. 0 < &, < M* < 00), we even have that R(x) is truly of the order of x? 
as x — +00 (according to a result of [94]) briefly discussed in Sect. 2 above; and 
conversely (still by Lapidus and Pomerance [94]), if R(x) is exactly of the order of 
x? [resp., if R(x) = O(x?)], then Y is Minkowski nondegenerate (resp.,.W@* < 00). 
(See Theorem 3.3 and Eq. (14).) 

The question is now to know when we really have an asymptotic second term, 
proportional to x”, in Weyl’s asymptotic formula (22). We will sometimes refer to 
such a term as a “ monotonic second term.” In [74], the following conjecture (called 
the MWB conjecture, for short) was stated. 


Conjecture 5.9 (MWB Conjecture [74]). If Z@ is a Minkowski measurable fractal 
string of dimension D € (0, 1), then 


N, (x) = W(x) — cp Gx” + 0(x”), as x > +00, (23) 


where .@ denotes the Minkowski content of 7 (i.e., of its boundary 02) and cp is 
a positive constant depending only on D. 


Remark 5.10. 


(a) In higher dimensions (i.e, when N > 2 and 2 C R% is a bounded open 
set), the counterpart of the MWB conjecture is not true, in general; see [34] 
and, especially, [95] for various counterexamples. Moreover, one does not know 
whether it is true for a simply connected domain in the plane (i.e., when N = 2). 
It is not the object of the present paper to discuss this issue further, although it 
is of significant interest and is also very intricate. 

(b) When N = 1, the Minkowski measurability condition is necessary, in general, 
for the spectral counting function N,, to admit a monotonic asymptotic second 
term, proportional to x?, as x — +00. For example, for the Cantor string 
L = Ls (defined by Qcs := [0, 1] \C, the complement in [0, 1] of the classic 
ternary Cantor set C), N, admits an oscillatory (asymptotic) second term. 
More specifically, for the Cantor string, the asymptotic second term is of the 
form —x? G(log, x), where G is a 1-periodic function which is bounded away 
from zero and infinity; see [94] and, especially, [101, Eq. (16.57) and §10.2.1]. 
This fact was first proved in [93, 94] (by a direct computation) and does not 
contradict the MWB conjecture since (as was also first proved in [93, 94]), the 
Cantor string is not Minkowski measurable (but is Minkowski nondegenerate). 
Moreover, these issues were investigated in great detail in [99-101], by using 
the theory of complex dimensions and associated explicit formulas developed 
in those monographs. (See, e.g., [101], Chap. 6, §8.4 and Chap. 10.) 


It is shown, for example, in [101, Chap. 6, §8.4] (building in part on con- 
jectures and results in [76]) that a self-similar string is Minkowski measurable 
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if and only it is nonlattice (i.e., the logarithms of its distinct scaling ratios are 
rationally independent). Moreover, it follows from Joc. cit. that a lattice self- 
similar string (e.g., the Cantor string) is never Minkowski measurable (but is 
Minkowski nondegenerate) and that its spectral counting function N, (x) always has 
an asymptotic second term which is oscillatory and of the order of x? (in fact, it is 
of the form x? G(log x), where G is a nonconstant periodic function on R which is 
bounded away from zero and infinity). 

More generally, we will see in Theorem 6.14 of Sect.6 (based on [87]) that 
if € = f(s) does not have any zeros on the vertical line {Re(s) = D}, then the 
existence of an asymptotic second term proportional to x? for the spectral counting 
function N,(x) implies that & is Minkowski measurable (i.e., the hypothesis of 
Conjecture 5.9 is also necessary). 


The following theorem (due to Lapidus and Pomerance in [94], and announced 
in [93]) gives the precise asymptotic second term of the spectral (or frequency) 
counting function of a Minkowski measurable fractal string, thereby resolving in the 
affirmative the MWB conjecture for fractal strings and yielding the specific value of 
the positive constant cp appearing in Eq. (23) above. 


Theorem 5.11 (Resolution of the MWB Conjecture for Fractal Strings [94)]). 
The MWB conjecture for fractal strings (Conjecture 5.9 above) is true for every 
D € (0,1). More specifically, if @ is a Minkowski measurable fractal string of 
dimension D € (0,1), then its frequency counting function N, = N,(x) admits a 
monotonic second term, proportional to x”, of the exact same form as in Eq. (23). 
Furthermore, the positive constant occurring in (23) is given by 


cp = (L—D) 2-"-)(—£(D)). (24) 


Note that cp > 0 because ¢ = ¢(s) is strictly negative in the critical interval 
0 <5 < 1 (see, e.g., [174]): —€(D) > 0 since D € (0, 1). Furthermore, in Eq. (24), 
the constant cp is proportional to the positive number —¢(D), and hence, to the value 
at s = Dof ¢ = ¢(s) in the critical interval 0 < 5 < 1. 

We next briefly comment on the structure of the proof of Theorem 5.11 given in 
[94]. It relies on two different theorems. Namely, the geometric characterization of 
Minkowski measurability (Theorem 3.2 above, from [94]) along with the following 
theorem (also from [94]), which is of an analytic number theoretic nature and whose 
motivation will be explained after its statement. (In the sequel, given y € R, [}] 
stands for the integer part of y and {y} := y—[y] € [0, 1) denotes the fractional part 
of y.) 


Theorem 5.12 (The Sound of Minkowski Measurable Fractal Strings [94)]). 
Let & be a Minkowski measurable fractal string of dimension D € (0,1). 
Equivalently, according to Theorem 3.2 (see Eq. (8) in Sect. 3), let 2 = (€))%, be 
a nonincreasing sequence of positive numbers satisfying (for some positive constant 
L> 0) 


6 ~ Lj? asj > oo. (25) 
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Then 


Yoel = | D0 & | xt S(D)LPX? + (27) as x > +00, (26) 
j=l 


j=l 


where € denotes the classic Riemann zeta function. Equivalently, Eq. (26) can be 
stated as follows: 


Ste) x} ~ —€(D) LPx? as. x > +00. 


j=l 


We note that in order to state Theorem 5.12, one does not need to assume that 
£ is Minkowski measurable and therefore, to rely on Theorem 3.2. Instead, one 
can simply assume that (25) [or, equivalently, (10)] holds for some D ¢€ (0, 1). 
Theorem 3.2 is then used when deducing Theorem 5.11 from Theorem 5.12. 

Given a fractal string 2 = Ue di (as in Sect. 2), its spectral counting function 
N, =: N,(Q,-) satisfies N,(2,-) = 2, N.G,-), with N,Uj,x) = [6x] for 
each j > 1 because J; is an interval of length ¢;. It follows that N(x) = 
ileal, which is the left-hand side of (26). In light of the first expression 
obtained for the Minkowski content .W in Eq.(15) (and the line following it), 
one then easily deduces Theorem 5.11 (and hence, the MWB conjecture for fractal 
strings, Conjecture 5.9) from Theorem 5.12. Note that |2|; = a £;, so that the 
leading term of N,(x) = ye i[¢; x] in (26) coincides with the Weyl term W(x) 
[given by (21)], as it should. Furthermore, as was alluded just above, by using 
the expression of .@ given by the first equality in (15) and eliminating L? in the 
asymptotic second term of (26), one establishes both the existence and the value of 
the positive constant cp (explicitly given by Eq. (24) of Theorem 5.11). 

Physically, the aforementioned relation, N, (2, -) = er N, Uj, +), follows from 
the fact that the intervals J; comprising the fractal string $2 (or, more poetically, 
the strings of the fractal harp) are vibrating independently of one another. Mathe- 
matically, it follows from the variational formulation of the underlying eigenvalue 
problem; see, e.g., [74] and the relevant references therein (including [120, 146]). 


6 Inverse Spectral Problems for Fractal Strings 
and the Riemann Hypothesis 


It is natural to wonder whether the converse of Theorem 5.11 (or essentially 
equivalently, of the MWB conjecture for fractal strings, Conjecture 5.9) is true. 
Roughly speaking, this means that if there are no oscillations of order D in the 
spectrum of &, then there are no oscillations of order D in the geometry of 2%. 
Rephrased: If N,(x) has an asymptotic second term, proportional to x?, is it true 
that Y is Minkowski measurable? 
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More precisely, given D € (0,1), the inverse spectral problem (ISP)p under 
consideration can be stated as follows: 


(ISP)p If @ is a fractal string of dimension D such that 
N, (x) = W(x) — @ x” + 0(x”) as x > +00, (27) 


for some nonzero (real) constant @, then is it true that LZ is Minkowski 
measurable (or, equivalently, in light of Theorem 3.2 and the comment following 
it, that Nv(x) ~ M x? as x — +00, for some positive constant M, where 
Nv = N¢g(x) denotes the geometric counting function of £)? 


The above problem, (ISP)p, is called an inverse spectral problem since given 
some spectral information about the fractal string “& (namely, the existence for 
N,(x) of a monotonic asymptotic second term, proportional to x”), one asks 
whether one can recover some geometric information about “ (namely, that & is 
Minkowski measurable). Similarly, the MWB conjecture from [74] (Conjecture 5.9 
above) and its resolution given in Theorem 5.11 above (from [94]) fall naturally 
within the class of direct spectral problems. 


Remark 6.13. 


(a) Inthe statement of (ISP)p, it is not necessary to assume that Y is of Minkowski 
dimension D since a result from [94] (recalled in Theorem 3.3 above and 
the comment following it) shows that (27) implies that & is Minkowski 
nondegenerate (i.e.,0 < & < .@* < oo) and hence, has Minkowski 
dimension D. 

Originally, in [86, 87], the error term o(x?) in the counterpart of Eq. (27), was 
assumed to be slightly smaller, namely, O(x? / log! *? x) as x + +00, for some 
56 > 0. However, an unpublished work of Hilberdink (2000, Written personal 
communication to the author, unpublished), using an improved version of the 
Wiener—Ikehara Tauberian theorem (see, e.g., [137]) used in [86, 87], shows 
that the more general estimate o(x?) suffices. (Actually, in [86, 87], the better 
error estimate is only used for the sufficiency part of the proof of Theorem 6.14, 
that is, the part requiring the use of a Tauberian theorem.) 

For Dirichlet boundary conditions (which are assumed here in (ISP)p), one 
must necessarily have @ > O in Eq. (27) because then, we have that N,(x) < 
W(x) for all x > 0, which implies that @ > 0. Note that by hypothesis, @ # 0. 


(b 


we 


(c 


wm 


In [87] (announced in [86]), Lapidus and Maier have shown that this family of 
inverse spectral problems (ISP)p, for D ranging through the critical interval (0, 1), 
is intimately related to the presence of zeros of € = ¢(s) in the critical strip 0 < 
Re(s) < 1 (i.e., to the critical zeros of €), and thereby, to the Riemann hypothesis. 
More specifically, we have the following two theorems (the second one really being 
a corollary of the first one). 


Theorem 6.14 (Characterization of the Nonvanishing of ¢ Along Vertical Lines 
in the Critical Strip [87]). Fix D € (0,1). Then, the inverse spectral problem 
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(ISP)p is true (that is, has an affirmative answer for every fractal string of 
dimension D satisfying the assumptions of (ISP)p) if and only if the Riemann zeta 
function € = €(s) does not have any zeros along the vertical line {Re(s) = D}; i.e., 
if and only the “D-partial Riemann hypothesis” is true. 


Theorem 6.15 (Spectral Reformulation of the Riemann Hypothesis [87]). The 
inverse spectral problem (ISP) p is true for every value of D in (0, 1) other than in 
the midfractal case when D = 1/2 [or equivalently, for every D € (0,1/2)] if and 
only if the Riemann hypothesis is true. 


Theorem 6.15 follows at once from Theorem 6.14 since the Riemann hypothesis 
states that f(s) = 0 with 0 < Re(s) < 1 implies that Re(s) = 1/2 (ie., that s 
belongs to the critical line {Re(s) = 1/2}). The fact that in Theorem 6.15, D can be 
equivalently assumed to be in (0, 1/2), (1/2, 1) or in (0, 1)\{1/2}, follows from the 
functional equation satisfied by € = ¢(s), according to which 


&(s) = €(1—s), forall s € C, (28) 
and hence, 
f(s) =0 6 €(1—s) =0, for 0 < Re(s) < 1. (29) 


Here, § = &(s) denotes the completed Riemann zeta function (or the global zeta 
function of Q, the field of rational numbers), defined by &(s) := 2~°/?I"(s/2)C(s). 
For the standard properties of ¢ and of &, we refer, e.g., to [29, 58, 66, 133, 174]. 
The next result follows immediately from Theorem 6.14 and the well-known 
fact according to which ¢ has a zero (and even infinitely many zeros, according to 
Hardy’s theorem [29, 174], even though it is not needed here) along the critical line 


{Re(s) = 1/2}. 


Corollary 6.16 ([87]). The inverse spectral problem (ISP)p is not true in the 
midfractal case when D = 1/2. 


The following interpretation of Theorem 6.15 and Corollary 6.16 has first been 
proposed in [75, 76] and has since been pursued in a different, but closely related 
context, in [49-53] and in [80], as will be briefly discussed towards the end of 
Sect. 7.4 below. 


Theorem 6.17 (The Riemann Hypothesis as a Mathematical Phase Transition 
[75, 76]). The Riemann hypothesis is true if and only if ISP) p, the inverse spectral 
problem for fractal strings, fails to be true (i.e., fails to have an affirmative answer) 
only in the midfractal case when D = 1/2. 


In [75, 76], Lapidus also wondered (in an open problem) whether the mathemat- 
ical phase transition conditionally (i.e., under RH, and in fact, if and only if RH 
is true) occurring at D = 1/2 could be understood (in a suitable model) as a true 
physical phase transition, of the type occurring in the theory of critical phenomena 
in statistical physics or quantum field theory. He also asked whether the (then) 
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intuitive notion of fractal “complex dimension,” underlying the proof of the key 
part of Theorem 6.14 (as will be explained next), could not be understood as a 
“complexified space-time dimension,” as in Wilson’s theory of phase transitions 
and critical phenomena [182]. 

We now briefly comment on the proof of Theorem 6.14, the central result of 
[87], from which Theorem 6.15 and Corollary 6.16 (as well as Theorem 6.17) 
follow, given the known properties of the Riemann zeta function. First, note that 
Theorem 6.14 consists of two different theorems (and was stated as such in [86, 87]). 
The part of Theorem 6.14 corresponding to a sufficient condition (for (ISP)p to 
be true) is established by using the Wiener—Ikehara Tauberian theorem (see, e.g., 
[137] or [87] for the statement of this theorem). That is, assuming that [for a given 
D € (0,1)], (s) 4 0 for all s € C on the vertical line {Re(s) = D}, one uses 
the aforementioned Tauberian theorem in order to show that the inverse problem 
(ISP)p has an affirmative answer (for all fractal strings of dimension D). On the 
other hand, in order to establish the converse, namely, the fact that the condition 
that €¢(s) ~ 0 for all s € C on the vertical line {Re(s) = D} is necessary for the 
inverse spectral problem (ISP)p to be true (i.e., to have an affirmative answer for all 
fractal strings of dimension D), one uses in [86, 87] in a key (but rigorous) manner 
the intuition (at the time) of complex dimension and its intimate connections with 
asymptotic oscillatory behavior (both in the geometry and the spectrum), along with 
Theorem 3.2 (from [94]).° 

More specifically, we reason by contraposition. Fix D € (0, 1) and assume that 
€(w) = 0, for some w € C such that Re(w) = D. Then, due to the basic symmetry 
of ¢ (namely, €(5) = €(s), where the bar indicates that we are taking the complex 
conjugate of the given complex number), we also have €(@) = 0. Let us write 
o = D+ it, witht € R; so that@ = D — it. Without loss of generality, we may 
assume that t > 0. (Clearly, t 4 0 since €(D) < 0 because 0 < D < 1.) 

Next, for x > 0, let 


U(x) = x? + BO? +2x*) = x7 (1 + 28 cos(z logx)), (30) 


for some coefficient 6 > 0 sufficiently small, and let V(x) := [U(x)], the integer 
part of U(x). It is easy to check that for all 6 small enough, U(x) > 0 and U is 
(strictly) increasing on (0, +00); furthermore, the range of U is all of (0, +00). 
Hence, for such values of 6, given any integer j > 1, we can uniquely define 
£; > 0 such that U(¢;) = j (and thus, V(¢;) = j). In this manner, we define 
a fractal string 2 = (€;)°, with geometric counting function Ny coinciding 
with V : Ny = V. In light of Eq. (30), Ny = V = [U] has sinusoidal (and 
hence, nontrivial periodic) oscillations, caused by the “complex dimensions” w 
and @. Therefore, N.v(x) cannot be asymptotic to x”; equivalently, we do not have 


>It can now also be systematically understood in terms of the generalized explicit formulas of [101, 
Chap. 5]; see [101, Chap. 9]. The resulting theorems and assumptions, however, are somewhat 
different. 
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Nev(x) ~ Mx? asx > +00, for some M > 0. (Note that here, according to (30), we 
would have to have M := 1.) It then follows from Theorem 3.2 (the characterization 
of Minkowski measurability, from [94]) and the comment following it that 7 is not 
Minkowski measurable. Moreover, and using the fact that ¢(w) = €(@) = 0, viaa 
direct computation® it is shown in [87] that Eq. (27) holds for the fractal string Y, 
for some (explicitly known) positive constant @. Consequently, the hypothesis (27) 
of (ISP)p is satisfied but its conclusion (namely, the Minkowski measurability of 7) 
is not; i.e., the inverse spectral problem (ISP)p cannot be true for this value of 
D € (0,1) because it fails to hold for this particular fractal string . This proves 
that if (SP)p is true for some D € (0, 1), we must have ¢(s) 4 0 for all s € C such 
that Re(s) = D, as desired. 

We point out that in the above construction, the geometric oscillations caused by 
the nonreal complex dimensions w and @ of # remain [as is obvious from (30)], 
but due to the fact that by construction, €(w) = €(@) = 0, the spectral oscillations 
[in the asymptotic second term of N,,(x)] disappear. Therefore, we see the subtle 
interplay between complex dimensions, geometric and spectral oscillations, as well 
as the critical zeros of ¢. (Clearly, both w and @ belong to the open critical strip 
0 < Re(s) < 1 since we have that Re(m) = Re(@) = D and D € (0, 1).) 

Finally, we note that by using the theory of complex dimensions developed in 
[99-101], it can be shown that the above fractal “ has exactly three complex 
dimensions (each of which has a real part equal to D and multiplicity one). Namely, 
the set Zy of complex dimensions of # is given by Jv = {D,w, @}. Moreover, 
observe that x® = x? x* and x? = x? x; so that the real part (resp., the 
imaginary part) of the complex dimension w (or @) determines the amplitude (resp., 
the frequency) of the corresponding geometric or spectral oscillations (viewed 
multiplicatively). This statement is now fully corroborated (for any fractal string and 
its complex dimensions) by the rigorous theory of complex dimensions of fractal 
strings and the associated (generalized) explicit formulas developed in [99-101]. 


7 Epilogue: Later Developments and Research Directions 


In this epilogue, by necessity of concision, we very briefly discuss further devel- 
opments closely connected to (or partly motivated by) the results discussed in the 
main body of this paper as well as by related results and conjectures in [74-76]. 
These topics include a geometric interpretation of the critical strip for the Riemann 
zeta function ¢ = ¢(s) (see Sect. 7.1), the theory of complex dimensions of fractal 
strings (see Sect. 7.2), fractal zeta functions and a higher-dimensional theory of 
complex dimensions (valid for arbitrary bounded subsets of Euclidean spaces and 
relative fractal drums (RFDs), see Sect.7.3), quantized number theory and the 


This is now proved in [99-101] by using the (generalized) explicit formulas from [99-101] and 
Eq. (1). 
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spectral operator, along with a functional analytic reformulation of the results of 
[87] discussed in Sect. 6, as well as a different framework (developed in [49-53]) 
and a new asymmetric reformulation of the Riemann hypothesis recently obtained 
by Lapidus in [80] (see Sect. 7.4). 


7.1 Fractal Strings, € = &(s), and a Geometric 
Interpretation of the Critical Strip 


The one-dimensional situation (i.e., the case of fractal strings) is ideally suited to 
the Riemann zeta function € = ¢(s) in the (closed) critical strip 0 < Re(s) < 1, 
as we have seen in Sect.5 and, especially, Sect. 6 above. This is due in part to the 
product formula (1), ¢,(s) = f(s) - Cv(s), connecting the geometric zeta function 
Cy of a fractal string - to its spectral zeta function ¢,,’ combined with the fact that 
fractal strings always have a (Minkowski) dimension between 0 and1:0<D< 1. 

These facts, along with the results of [93, 94] and some of the results and 
conjectures of [74-76], have led to the following geometric interpretation of the 
(closed) critical strip 0 < Re(s) < 1, using the terminology introduced in [74] (in 
any dimension): The least (resp., most) fractal case when D = 0 (resp., D = 1), 
corresponds to the left- (resp., right-) hand side of the critical strip, that is, to 
the vertical line {Re(s) = O} (resp., {Re(s) = 1}. Furthermore, the midfractal 
case when D = 1/2 corresponds to the critical line, namely, the vertical line 
{Re(s) = 1/2} where (according to the Riemann hypothesis) all of the nontrivial 
(or critical) zeros of € = C€(s) are supposed to be located. This geometric picture 
of the critical strip has later been corroborated by the work in [86, 87] described 
in Sect. 6 above. Its complete and rigorous justification has then been provided by 
the theory of complex dimensions for fractal strings (that is, in the one-dimensional 
situation) developed in [99-101]. (See, in particular, [101, Chaps.9 and 11].) We 
will next briefly describe (in Sect. 7.2) a few aspects of the latter theory. 


7.2 Complex Dimensions of Fractal Strings 
and Oscillatory Phenomena 


The theory of complex dimensions of fractal strings, developed by Lapidus and van 
Frankenhuijsen in the research monographs [99-101] (and several corresponding 
articles), aimed originally at obtaining a much more accurate understanding of the 
oscillatory phenomena which are intrinsic to fractals (in their geometries and their 


7We note that the product formula (1) for the spectral zeta functions of fractal strings has since 
been extended in various ways in the setting of Laplacians on certain self-similar fractals; see 
[26, 72, 73, 172, 1731. 
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spectra as well as in the underlying dynamics). This is accomplished via explicit 
formulas (generalizing Riemann’s original 1858 explicit formula and its extensions 
to various aspects of number theory and arithmetic geometry, see [24, 29, 58, 131- 
133, 157, 174] along with [101, §5.1.2 and §5.6]) expressed in terms of the 
underlying complex dimensions; see [101, Chap. 5]. The latter complex dimensions 
are defined as the poles of a suitable zeta function, typically a geometric, spectral, 
dynamical, or arithmetic zeta function. (See [101].) 

In the case of fractal strings, which is the main focus of the theory developed 
in [99-101], these explicit formulas can be applied, for example, to the geometric 
counting function Ny(x), the spectral counting function N,(x), the volume of 
tubular neighborhoods V(e) (giving rise to a “fractal tube formula’), or to the 
counting function of the number of primitive geodesics of an underlying dynamical 
system. (See [101, Chaps. 6—11].) In the special case of self-similar strings, the 
resulting fractal tube formulas give very precise information, in part due to the 
knowledge of the periodic (or, in general, the quasiperiodic) structure of the complex 
dimensions (see [101], Chaps. 2—3 and Sect. 8.4). 

For a fractal string &, under mild growth assumptions on the associated 
geometric zeta function Cy (see [101, §5.1]) and assuming that all the complex 
dimensions (i.e., the poles of ¢y) are simple, the fractal tube formula for V(¢) = 
V.v(e) takes the following form (see [101, Chap. 8]): 


1-—w 
V(e) = 0, as ay TRO (31) 


wED 


where Y = Dy denotes the set of (visible) complex dimensions of Y and a, := 
res(€v,w) for each w € Y. Furthermore, the error term R(e) is precisely estimated 
in [101]. In the important special case of self-similar strings (which includes the 
Cantor string discussed just below), one can take R(e) = 0 and therefore obtain an 
exact fractal tube formula (which holds pointwise); see [101, $8.4]. 


Remark 7.18. 


(a) The explicit formula is valid for multiple poles as well but must then take 
a different form, in general. Namely, we then have (see [101, Theorems 8.1 
and 8.7]) 


c 2 1—s 
Ve) = Sores b2(9) Ce) ) + RE). (32) 
s(1—s) 
wED 
(b) The fractal tube formulas (31) and (32) hold pointwise or distributionally, 


depending on the growth assumptions made about ¢¥; see [101], Sect. 8.1, 
especially, Theorems 8.1 and 8.7 and their corollaries. Also, in the so-called 
strongly languid case (in the sense of [101, Definition 5.3]), we can let R(e) = 0 
and therefore obtain an exact formula. This is the case, for example, for all self- 
similar strings; see [101, §8.4]. 
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(c) For the first higher-dimensional analog of (31) and (32), see [99-101] for 
specific examples of fractal sprays (in the sense of [95]), and for a fairly general 
class of fractal sprays and self-similar tilings (or, less generally, sets), see 
[91, 92, 104, 105]; see also [134, 135] and, for a direct approach in the case of 
the Koch snowflake curve [90]. Within the general higher-dimensional theory of 
complex dimensions developed in [108-113, 115, 116], the precise counterpart 
of (31) and (32) is provided in [108, 113] and [116, Chap. 5]; see Sect. 7.2 for a 
brief discussion. 


Let us now specialize the above discussion to the Cantor string & = CS, viewed 
geometrically as the bounded open set §2 C R, defined as the complement of the 
classic ternary Cantor set in [0, 1]; hence, 02 is the Cantor set. Then 2 = CS = 
(€)) 1, with Ly => 1/3, £2 => £3 => 1/9, £4 => Ls => £6 => Ly => 1/27, --- < 
Alternatively, the lengths of the Cantor string (or harp) are the numbers 3~"~! 
repeated with multiplicity 2”, form = 0,1,2,---. Then 


= 


3 
fcs(s) = 7-2.3-8° for all s € C; (33) 


so that the set Ys of complex dimensions (here, the complex solutions of the 
equation 1 — 2-37-* = 0) is given by 
Des = {D + inp Ine Zh, (34) 


where D := log, 2 = log 2/ log 3 (the Minkowski dimension of CS or, equivalently, 
of the ternary Cantor set) and p := 2z/1log3 (the oscillatory period of CS). 
Then (31) (with R(x) = 0) becomes 


1 (2e)!-P-inp 
Ves(€) = ag 35 
cs(©) = F583 dX (D + inp)(1—D—inp) ~~ G9) 


e!P G(loge !) —2e, 


II 


where G is a periodic function which is bounded away from zero and infinity. One 
then recovers the result from [93, 94] according to which CS (and hence also, 
the Cantor set) is Minkowski nondegenerate but is not Minkowski measurable. 
(Actually, the values of .#%, and .@* are computed explicitly in [94] as well as, 
in a more general context, in [101, Chap. 10].) 

Moreover, the geometric counting function of CS is given by 


xDtinp 
-—1 
D+ inp 


1 
Nes(x) = > (36) 


2 log 3 “ot 
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while the corresponding frequency counting function is given by 


Nyes) = 3+ 5 OD + inp) = 


a O(1) as x > +00. (37) 
210 og 3 neZ 


(See [101], Sect. 1.1.1, Sect. 1.2.2 and Chap. 6.) 

In (31) and (35)-(37), we see the intuitive (and actual) meaning of the complex 
dimensions. Namely, the real (resp., imaginary) parts correspond to the amplitudes 
(resp., frequencies) of the oscillations [in the spaces of scales, for (31) and (35)— 
(36), and in frequency space, for (37)]. 


Remark 7.19 (Fractality and Complex Dimensions). The notion of fractality is 
notoriously difficult to define. Mandelbrot [121] has proposed to define a fractal as a 
geometric object whose Hausdorff dimension is strictly greater than its topological 
dimension. There is an obvious problem with this definition (Mandelbrot was 
aware of it, as is stated in his book, [121, p. 82]). Namely, the Cantor curve (or 
“devil’s staircase”) is not fractal in this sense (since its Hausdorff, Minkowski, and 
topological dimensions are all equal to 1); however, as Mandelbrot states in [121], 
everyone would agree that the Cantor curve should be called “fractal”. This issue 
has long preoccupied the present author. 

There is, however, a satisfactory way to resolve this apparent paradox as well 
as many other related and unrelated issues. In the theory of complex dimensions 
developed in [99-101], an object is called “fractal” if it has at least one nonreal 
complex dimension (with positive real part). (See [101, §12.1].) Accordingly, the 
Cantor curve (CC) is, indeed, fractal because its set of complex dimensions is 
given by Yoo = Bes U {1}, where Yes is the set of complex dimensions of the 
Cantor set (or, equivalently, the Cantor string) given by (34). (Hence, Ycc has 
infinitely many nonreal complex dimensions with positive real part.) Furthermore, 
(nontrivial) self-similar strings (and, more generally, self-similar geometries) are 
fractal. In order to accommodate random (and, in particular, stochastically self- 
similar fractals), as in [44] and the relevant references therein, the above definition 
of fractality has been extended to allow for a “fractal” A to be such that its associated 
zeta function ¢, has a natural boundary along a suitable curve of the complex plane, 
called a “screen” in [101, §5.1] (and hence, such that ¢4 cannot be meromorphically 
extended beyond that curve). In [108-113, 115, 116], such a compact subset of 
RY is called a “hyperfractal.” In [110, 112, 116] are even constructed “maximal 
hyperfractals”; that is, compact subsets A of R“ (where N > 1 is arbitrary) such that 
€, has anonremovable singularity at every point of the “critical line” {Re(s) = D}, 
where D is the Minkowski dimension of A. 


Remark 7.20 (Noncommutative Riemann Flow of Zeros and the Riemann Hypoth- 
esis [79]). In the long term, the theory of complex dimensions aims at unifying 
many aspects of fractal and arithmetic geometries. Many concrete and more abstract 


8 An example of a trivial self-similar set is an interval of R or a cube in RY (N > 2). In such cases, 
all of the complex dimensions are easily seen to be real; see [101, 112, 113] and [116]. 
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examples are provided in [101]. This direction is pursued in many different 
directions in the author’s book, In Search of the Riemann Zeros [79], where is 
introduced the notion of fractal membrane (i.e., quantized fractal string) and the 
associated moduli spaces of fractal membranes (as well as of fractal strings). In 
[79], a still conjectural (noncommutative) flow on the moduli space of fractal 
membranes and correspondingly, a flow of zeta functions (or partition functions) 
and a flow of zeros are used in an essential manner in order to provide a possible 
new interpretation of (and approach to) the Riemann hypothesis. 

Accordingly, the flow of fractal membranes can be viewed as some kind of 
noncommutative Ricci flow which is transforming (as “time” tends to infinity or 
physically, as the absolute “temperature” tends to zero) generalized quasicrystals 
into (self-dual) pure crystals. Correspondingly, the zeta functions are becoming 
increasingly symmetric (that is, satisfy a true functional equation, in the limit), 
while the zeros (of these zeta functions) are pushed under this flow (viewed as 
a flow on the Riemann sphere) onto the Equator, which naturally represents the 
critical line in this context. Conjecturally, this would also provide a proof of the 
generalized Riemann hypothesis (GRH), valid for all of the number theoretic zeta 
functions for which the analog of the Riemann hypothesis is expected to be true. 
(See, e.g., [131, 132, 157] and [101, Appendix A] or [79, Appendices B, C, and 
E].) Very concisely, this is the interpretation of (and approach to) the Riemann 
hypothesis proposed in [79]. Needless to say, it poses formidable mathematical 
and physical challenges but may nevertheless stimulate further investigations of 
the aforementioned (noncommutative) Riemann (or “modular’’) flow, whether it be 
viewed as a (noncommutative) Ricci-type flow or, physically, as a renormalization 
flow (not unlike the one which presumably describes the time evolution of our 
universe). 


7.3. Fractal Zeta Functions of Arbitrary Compact Sets 
and Higher-Dimensional Theory of Complex Dimensions 


In a forthcoming book by Lapidus, Radunovié and Zubrinié, entitled Fractal Zeta 
Functions and Fractal Drums [116] (see also the series of research and survey 
papers [108-113, 115]), is developed a higher-dimensional theory of complex 
dimensions, valid for any bounded subset of N-dimensional Euclidean space R™, 
with N > 1 arbitrary. Distance and tube zeta functions are defined in that general 
setting. Let D be the (upper) Minkowski dimension of a given (nonempty) bounded 
subset A of R”. Then it is shown in [110, 116] that the distance zeta function of A, 
denoted by €4 = €4(s) and defined (for all s € C with Re(s) sufficiently large and 
for a given 6 > 0) by 


aii= | d(x, A)’ dx (38) 


is holomorphic for Re(s) > D. 
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Remark 7.21. 


(a) This new type of fractal zeta function, €4, was introduced by the author in 2009 
in order to be able to extend to any higher dimension N the theory of complex 
dimensions of fractal strings developed in [99, 100] (and now also in [101]). The 
case of fractal strings corresponds to N = 1; in that case, the precise relationship 
between Cy and €,, with A := 0.2, is explained in [110, 116]. 

(b) For the simplicity of the discussion, we assume here that |A| = 0 (i.e., A is a 
Lebesgue null set), which is the case of most fractals of interest. We refer to 
[116] for a discussion of the general case. 


A first key result of the theory is that the abscissa of convergence of this 
Dirichlet-type integral (viewed as a Lebesgue integral) or, equivalently, the abscissa 
of (absolute) convergence of €4, coincides with D: 


o =D, (39) 


the (upper) Minkowski (or box) dimension of A. Therefore, {Re(s) > D} is the 
largest open right half-plane (of the form {Re(s) > a}, for some a € RU{+o00}), on 
which the Lebesgue integral appearing on the right-hand side of (38) is convergent 
(i.e., absolutely convergent). This result is the higher-dimensional counterpart of 
Theorem 2.1 above (first noted in [75, 76] in the case of fractal strings, i.e., when 
N = 1). The proof of Eq. (39) makes use of an interesting integral estimate obtained 
in [46] in order to study the singularities of solutions of certain linear partial 
differential equations. (See also [183].) 

Moreover, under mild assumptions, namely, if D < N (recall that we always have 
0<D<WN), MP > 0 and the Minkowski dimension of A exists (i.e., the lower and 
upper Minkowski dimensions of A coincide), then {Re(s) > D} is also the maximal 
open right-half plane to which €, = €,(s) can be holomorphically continued; i.e., 
D coincides with Djoj(,4), the abscissa of holomorphic continuation of €,: 


D=o = Dnoi(Ea), (40) 


where, as above, D = D(A) and 0 = Dgos(Ca). 
An entirely analogous theorem holds for the tube zeta function of A, denoted by 
€4 = €,(s) and defined by 


Z 3 d 
Ex(s) = / All, (41) 


for all s € C such that Re(s) > D. 7 

In fact, it can be shown that the two functions, C4 and (N — s) €,, differ by 
an entire function, from which it follows that (for D < N), the two fractal zeta 
functions €4 and ¢4 have exactly the same qualitative properties. In particular, given 
a domain U C C containing {Re(s) > D}, the common half-plane of (absolute) 
convergence of €4 and f4, €4 can be meromorphically extended to U if and only if 
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it is the case for é ‘4. Furthermore, 4 and é ‘4 have exactly the same poles, in U (with 
the same multiplicities). These poles are called the (visible) complex dimensions 
of A. Moreover, the residues of €4 and c ‘4 at a simple pole (or, more generally, the 
principal parts at a multiple pole) are related in a very simple way. For example, if 
Dis simple, then we have 


res(€4,D) = (N—D) res(€,,D). 


Finally, if €4 (or, equivalently, é 4) can be meromorphically continued to an open 
and connected neighborhood of D, then D is a simple pole of ¢4 and ¢, (ie., it is a 
complex dimension of A)’ and the residue of ¢, at s = D is squeezed between the 
lower and upper Minkowski contents of A: 


Ms < res(&4;D) < M*. (43) 


If, in addition, A is Minkowski measurable (with Minkowski content denoted by .#, 
as before), then 


res(€4; D) =U. (44) 


Both formulas (43) and (44) are valid even if D = N, although the justification 
of this statement requires a specific argument; see [116]. 

We should note that, in light of the aforementioned functional equation connect- 
ing ¢4 and ¢,, the choice of 6 is unimportant in the definition of ¢4 and ¢4 (in (38) 
and (41), respectively). Furthermore, the residues of €4 and ¢,4 at (simple) complex 
dimensions are independent of 6. 

In the special case when N = | (that is, for fractal strings), it is shown (still in 
[110, 115, 116]) that the distance zeta function C, and the geometric zeta function 
¢ are closely related (here, A := 0 and & is the fractal string associated with 
the bounded open set 2 C R). In fact, given any subdomain U of {Re(s) > 0} 
(or, more generally, of C\{0}), &y has a meromorphic continuation to U if and only 
if €,4 = faq does. Consequently, ¢y and C4 have the same poles (with the same 
multiplicities) in U; i.e., they have the same visible complex dimensions. This is 
true independently of the geometric realization 2 of the fractal Y = (€))~ as a 
bounded open subset of R. In particular, we recover Theorem 2.1. 

A variety of results are provided in [108-113, 115, 116], guaranteeing the 
existence of a suitable meromorphic extension of €, (and of €,4) beyond the 
half-plane of convergence {Re(s) > D}. Moreover, many examples of computation 


°Since ¢4 and €, are holomorphic in the open half-plane {Re(s) > D}, it then follows that 
D = max{Re(s): s € J}, (42) 


where ZY is the set of (visible) complex dimensions of A in any domain U containing this 
neighborhood of {D}. 


Fractal Strings and the Riemann Hypothesis 231 


of the complex dimensions of a variety of fractals (when N = 1,2 or N > 3) 
are provided throughout [108—113, 115, 116]. These fractals include (one- or two- 
parameter) families of generalized Cantor sets, the Sierpinski gasket and carpet, a 
relative fractal drum counterpart of the Menger sponge (a three-dimensional analog 
of the Sierpinski carpet, see, e.g., [121]), families of spirals, fractal curves such as 
the Cantor curve (or devil’s staircase), which is examined from several points of 
view, as well as geometric chirps and curves with cusps. The theory is extended 
to a new class of objects, called RFDs, which allow a much greater flexibility in 
many situations and generalize both the usual fractal drums (i.e., drums with fractal 
boundary, in the sense, for example, of [74-76], as discussed in Sect. 4 above) and 
the class of bounded subsets of Euclidean space R. It is noteworthy that in the 
case of RFDs, the relative Minkowski dimension can be negative; it can even take 
the value —oo, as is shown in [109, 112, 116], where geometric explanations are 
provided for this “dimension drop” phenomenon. 

In short, a RFD is a pair (A, $2), with A an arbitrary (possibly unbounded) subset 
of R% and @ an open subset of R” with finite volume (i.e., |2|y < oo) and such 
that 2 C As, for some 6 > 0. The special case of an ordinary fractal drum (or 
“drum with fractal boundary,” in the sense of [74—76]) corresponds to the case where 
A = 02 and &2 is as above. 

Furthermore, the main results of [74] (see Sect.4.1 above, especially, Theo- 
rem 4.8) are used in an essential manner in order to show that the spectral zeta 
function of a fractal drum (that is, of the Dirichlet Laplacian on a bounded open 
subset of IRY, with N > 1) can be meromorphically extended to (at least) the open 
half-plane {Re(s) > D}, where (as before) D is the (upper) Minkowski dimension 
of the boundary 02 of the drum. [This was already observed in [76], using [74] 
and a well-known Abelian-type theorem (in the sense of [168]), the converse of a 
Tauberian theorem. It can also be deduced from the main result of [74] by a direct 
argument, based on the holomorphicity of an integral depending analytically on a 
parameter.] Moreover, using the construction of a maximal hyperfractal!° carried 
out in [109, 110, 116] (and alluded to towards the end of Remark 7.19 above), it is 
shown that the half-plane {Re(s) > D} is optimal, in general; i.e., it cannot usually 
be larger, from the point of view of the meromorphic continuation of the spectral 
zeta function. We note that under suitable assumptions, the Dirichlet Laplacian can 
be replaced by a higher order positive, self-adjoint elliptic operator, with possibly 
variable coefficients and with Dirichlet, Neumann or mixed boundary conditions; 
see [74, 109, 116]. 

Extensions of the results of [108—113, 115, 116] to unbounded sets, in particular, 
are provided in [140]. 


‘ORecall that in the terminology of [108-113, 115, 116], a “maximal hyperfractal” is a compact 
subset A of R™ such that ¢, has a nonremovable singularity at every point of the “critical line” 
{Re(s) = D}; in addition to Remark 7.19, see Remark 7.22 below. 
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Finally, we note that €, and é ‘4 remain unchanged if we replace the bounded set 
A C R\ by its closure A. As a result, we could have assumed throughout that A 
was a compact subset of R”. Moreover, we point out that fractal tube formulas, 
significantly extending to compact subsets of R” the corresponding (pointwise 
and distributional) tube formulas obtained in [101, §8.1-8.3] for fractal strings (as 
well as the later tube formulas obtained for fractal sprays in [91, 92, 104, 105]), 
are established in [108, 113] and [116, Chap. 5], without any assumptions of self- 
similarity and in every dimension N > 1. 


Remark 7.22. We close this discussion by mentioning the fact that the higher- 
dimensional theory of fractal zeta functions and the associated complex dimensions 
now enables us to extend to the general (higher-dimensional) setting the definition 
of fractality introduced in [99-101] (see, especially, [101, §12.1 and §12.2)). 
Accordingly, a geometric object is said to be “fractal” if it has at least one nonreal 
complex dimension (with a positive real part). In particular, one can now precisely 
talk about the fractality of any bounded subset A (or, more generally, RFD) in R, 
with N > 1 arbitrary. (Compare with Remark 7.19 above.) 

As an example, the Cantor curve (i.e., the “devil’s staircase” in the terminology 
of [121]) is fractal according to this definition, whereas it is not fractal according 
to Mandelbrot’s original definition in [121] (because its Hausdorff, box and 
topological dimensions coincide and are equal to one). (Recall from Remark 7.19 
that in [121], a geometric object is said to be “fractal” if its Hausdorff dimension 
is strictly greater than its topological dimension.) One can also check that most of 
the classic “fractals” (for example, the ternary Cantor set and its generalizations, as 
well as the Sierpinski gasket and carpet and their higher-dimensional counterparts) 
are indeed fractal in this new sense. (See [109, 112, 113, 116] for these and many 
other examples.) 

A technical challenge remains to prove this same result for all (suitable) self- 
similar sets (as was done when N = 1 for self-similar strings in [99-101] and when 
N > 2 for a large class of self-similar sprays or tilings in [91, 92, 104, 105]),!! as 
well as for the classic types of fractals occurring in complex dynamics (Julia sets, the 
Mandelbrot set and their generalizations; see, e.g., [118, 121, 167]) and in conformal 
dynamics and/or hyperbolic geometry (for instance, limit sets of Fuchsian groups 
and Kleinian groups; see, e.g., [5]), possibly by using other gauge functions than 
the usual ones based on the standard power laws, as was originally done in [47] and 
further studied in [108—113, 115, 116] (from the point of view of the new higher- 
dimensional theory of complex dimensions). It may be difficult to do so, but there 
is no doubt in the author’s mind that all of the classic (deterministic) fractals will 
eventually be found to be “fractal” in the above sense or “hyperfractal,” in a sense 
to be explained next. 


'l Added note: In the case of self-similar sprays, the results of [108, 112, 113, 116] now enable one 
to recover and significantly extend the results of [91, 92, 104, 105]. 
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Due in part to the work in [44] on random fractal strings and their complex 
dimensions, the notion of fractality was extended as follows (first in [100] and [101, 
§13.4.3] and then, in any dimension, in [108—113, 115, 116]). A geometric object is 
said to be “fractal” if it has at least one nonreal complex dimension with a positive 
real part (as above) or else if it is “hyperfractal”; i.e., if the associated fractal zeta 
function has a natural boundary along some suitable contour in C (a “screen,” in 
the sense of [101, §5.3]). We note that the term “hyperfractal” was introduced in 
[108-113, 115, 116], in this context. 

Therefore, a “hyperfractal” is such that the associated fractal zeta function cannot 
be meromorphically extended beyond a certain “screen.” Furthermore, in [109- 
112, 115, 116] is also introduced the notion of “maximal hyperfractal,” according 
to which the corresponding fractal zeta function has a nonremovable singularity 
at every point of the critical line of convergence {Re(s) = D}, where D is the 
Minkowski dimension of A C RX (or its relative counterpart, in the case of an RFD). 
It is then shown in ({109-112] or [116, Chap. 4]) using, in particular, countably 
many suitable fractal strings assembled in an appropriate way, along with Baker’s 
theorem from transcendental number theory [4], that maximally hyperfractal strings 
(N = 1), as well as maximally hyperfractal compact subsets and RFDs of RY 
(for every N > 1) can be explicitly constructed. This construction is completely 
deterministic. The author conjectures (building on [44] and [101, §13.4.3]) that for 
large classes of random fractals, maximal hyperfractality is an almost sure property. 

In closing this remark, we mention that the above definition of fractality cannot 
just be applied to standard geometric objects embedded in Euclidean spaces (or, 
more generally, in appropriate metric measure spaces) but is also applicable, in 
principle, to ‘virtual’ geometries, spectral geometries, dynamical systems, algebraic 
and noncommutative geometries (not necessarily consisting of ordinary points; see, 
e.g., [19, 22]), as well as arithmetic geometries. In fact (along with the proper notion 
of zeta function and the associated complex dimensions), it should be used as a 
unifying tool between these apparently vastly different domains of mathematics. 
This long-term goal has been one of the central motivations of the author (and his 
collaborators) in [20, 30, 47, 74-85, 87, 88, 94, 98-101, 104, 116] (particularly, in 
[75, 76, 78, 80-82], as well as in the books [79, 99-101]). 

For instance, the author conjectures that there exists a natural fractal-like geo- 
metric object whose complex dimensions are precisely the critical (i.e., nontrivial) 
zeros of the Riemann zeta function. Furthermore, the essential “shape” of this 
object should be understood in terms of a yet to be constructed cohomology theory 
associated with the underlying complex dimensions (and the pole of € = €(s) at 
s = 1). In particular, connections with Deninger’s work (and conjectures) in [24, 25] 
arise naturally in this context. (See, especially, [79, 81, 82], along with [101, §12.3 
and §12.4].) Significant progress along those lines has recently been made by the 
author in [81, 82]. 
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7.4 Quantized Number Theory, Spectral Operators 
and the Riemann Hypothesis 


Formula (1), ¢,(s) = €(s) - Cv(s), which connects the spectral zeta function ¢, = 
¢, y and the geometric zeta function ¢ of a fractal string & via the Riemann zeta 
function ¢, has the following counterpart, in terms of the spectral and geometric 
counting functions of Y (see [101, Theorem 1.2]): 


na) = OMe (): (45) 


n=1 


for any x > 0. Note that for a fixed x > 0, only finitely many terms are nonzero 
on the right-hand side of (45). However, the number of these terms tends to +00 as 
X —> +00. 

The (heuristic) spectral operator, at the level of the counting functions, is then 
given by the map 


Cc 
g:= Ne Nlg):= de (-): (46) 

n=1 
that is, N,(g)(x) := °°, g(x/n), for all x > 0. It can therefore be thought of as the 
map sending the geometry [represented by Ny, given by Eq. (11)] of a fractal string 


£ onto the spectrum (represented by N, = N,,.) of a fractal string &. Here, as in 
the discussion preceding Eq. (13), 


N,(x) = Hf € o(L) if x}), (47) 


for any x > 0, where o(.2) = {n- i :n>= 1,j = 1} is the (frequency) spectrum 

of 2 = (€)%). 

Remark 7.23. Atan even more fundamental level (that of the “density of geometric 

states” 7 := ane byp—1y and the “density of spectral states” v := Vfeo(#) dsp}, SEC 
J 

[101, §6.3.1]), the spectral operator can be viewed as the map sending the geometry 


(represented by 7) of a fractal string “ = (€;)?*, onto its spectrum (represented 
by v): 


co 


roe) 
qi= be; 4 rev= v(7) = Binet, (48) 
i=l _ 


jn= 


where 6;,, denotes the Dirac point mass at y > 0 and the “generalized fractal strings” 
n and v are viewed as positive (local) measures (or as tempered distributions) on 
(0, +00); see [101, §6.3.2]. We also refer to [101, §6.3.1] for explicit formulas 
expressing 7 and v in terms of the underlying complex dimensions of #, along 
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with [101, Chap. 4] for the notion of generalized fractal string, based on the notion 
of local measure from [27, 63, 64] and [116, Appendix A]. 


Viewed additively (that is, after having made the change of variable x = e’, t = 
log x, with x > 0 and ¢ € R), the (heuristic) spectral operator, denoted by a, becomes 
the following map (where f = f(t) is a suitable function of the new variable t € R): 


SO alf)() = Yo f(t—logn). (49) 


n=1 


As before, let Y denote the set of all prime numbers. Then, given any p € Y, 
the associated (local) Euler factor a, is given by the operator 


fO af) = Yo f(t— mlogp). (50) 


m=0 


The spectral operator a and its (operator-valued) Euler factors a, (with p € #) are 
connected via the following (operator-valued) Euler product representation of a: 


a(f)() =| [] A] ©. (51) 


pe? 


with a, = (1 — p~°)~!. Here, we have let 0 = d/dt denote the differentiation 
operator (also called the infinitesimal shift of the real line). We have (for any  € R) 


(o™)AW =f(t—A); (52) 


in particular, we have 


(n")(f)(t) = f(t — logn), (53) 


so that we should also expect to be able to obtain the following (operator-valued) 
Dirichlet series representation of a: 


foe) 


a(f)(t) = (>: ~) (f)(t). (54) 


n=1 


All of these semi-heuristic formulas and definitions are given without proper 
mathematical justification in [101, §6.3.2] (and, originally, in [100, §6.3.2] where 
the heuristic spectral operator a was first introduced). However, in [49-53], Herichi 
and Lapidus have developed a rigorous functional analytic framework within 
which (under suitable assumptions) all of these “definitions” and formulas are 
properly justified and, among many other results, the reformulation of the Riemann 
hypothesis obtained in [87] (see Theorem 6.15 above) can be restated in operator 
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theoretic terms; namely, as the “quasi-invertibility” of the spectral operator in all 
possible dimensions, except in the midfractal case 1/2. 

We refer the interested reader to the forthcoming book [53], titled Quantized 
Number Theory, Fractal Strings and the Riemann Hypothesis, for a complete 
exposition of the theory. We limit ourselves here to a brief exposition and a few 
additional comments. 

For each fixed c € R, let H, = L?(R,e~?“dt),'* and let 0 = d/dt be the 
unbounded (and densely defined) operator acting on the Hilbert space H, via 
a(f) = f’, the distributional (or else, the pointwise almost everywhere defined) 
derivative of f, for all f in the domain D(0) of a: 


D(0) := {f € Caps(R) NH, : f’ € He}, (56) 


where C,»;(R) is the space of (locally) absolutely continuous functions on R (see, 
e.g., [21, 35, 151]). Then, the infinitesimal shift 0 = 0, is shown in [49, 50, 53] to 
be an unbounded normal operator on H, (that is, d*d = 00*, where 0* denotes the 
adjoint of the closed unbounded operator 0; see, e.g., [67] or [152]), with spectrum 
o(0) = {Re(s) = c}. Furthermore, the associated group {e~"h € R is shown to be 
the shift group given (for all f € Hl.) by (52) while n~ is therefore given by (53). 

Moreover, for every c > 1, the quantized (i.e., operator-valued) Euler product 
and Dirichlet series representations (51) and (54) of a = a, hold, in the following 
strong sense: a = )°°, n° anda = Teg — p~°)—!, where both the series 
and the infinite product converge in @(H_,), the Banach algebra of bounded linear 
operators on the Hilbert space H,. (See [52] or [53, Chap. 7].) 

In addition, for all f in a suitable dense subspace of D(0) (and hence, of H.), the 
spectral operator a = a,, now rigorously defined (for every c € R) by the formula 
a := C(d), is given (for c > 0) by an appropriate operator-valued version of the 
classic analytic continuation of the Riemann zeta function € = ¢(s) to the open 
half-plane {Re(s) > 0}. Namely, formally, we have 


1 0 n+1 z= - 
a=t@=>— +) f (x? — rat. (57) 


n=1°" 


(Compare, for example, with [165, §VI.3] or the proof of Theorem 5.12 above given 
in [94].) On this dense subspace, it also coincides with a quantized (and convergent) 
version of the Euler product and the Dirichlet series. 


Hence, Hl, is the complex Hilbert space of (Lebesgue measurable, complex-valued) square 
integrable functions with respect to the absolutely continuous measure e~*“dt, and is equipped 
with the norm 


1/2 
lille 2= ( [vor ar) 206 (55) 


and the associated inner product, denoted by (-, -)¢. 
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Similarly, the global spectral operator of := &(0), where, as before, &(s) := 
m~‘/?"(s/2) f(s) is the global (or completed) Riemann zeta function, is shown 
to be given by an operator-valued (or quantized) version of the standard analytic 
continuation of § = &(s) to the entire complex plane (see, e.g., [174, §2.6] or [79, 
§2.4]). Again, we refer the interested reader to [52] or [53, Chap. 7] for the precise 
statements and many related results. 

We note that the spectral operator a = a,, now defined for any c € R bya := 
€(d), is obtained via the functional calculus for unbounded normal operators (see 
[152]); so that, formally, one substitutes the infinitesimal shift 0 for the complex 
variable s in the usual definition of the Riemann zeta function (or, rather, in its 
meromorphic continuation to all of C, still denoted by ¢ = ¢(s), for s € C). 

As it turns out, according to the celebrated spectral theorem (for unbounded 
normal operators, see [152]), what matters are the values of € = ¢(s) along the 
spectrum o (0) of 0; that is, according to a key result of [49, 53], along 


o(0) = cl(({s € C:: Re(s) = c}), (58) 


where c£(G) denotes the closure of G C Cin C. 

We point out that for c = 1, the unique (and simple) pole of ¢, we must assume 
that s 4 1 on the right-hand side of (58). Alternatively, one views the meromorphic 
function ¢ as a continuous function with values in the Riemann sphere C:=Cu 
{coo}, equipped with the standard chordal metric. (We set (1) = oo.) Then, even for 
c = 1, the extended spectrum (a) of a is given by the right-hand side of (58), but 
without the closure, where O(a) := o(a)U {co} if ais unbounded and a (a) := o(a) 
if a is bounded; that is, according to [49-53], if c < 1 orifc > 1, respectively. 

One therefore sees how the properties of the Riemann zeta function € = ¢(s) 
(for example, its range along the vertical line {Re(s) = 1} or the existence of a 
pole at s = 1) are reflected in the properties of the spectral operator a = €(0) (for 
example, whether or not a is bounded or invertible or, equivalently, whether or not 
its spectrum o(a) is compact or does not contain the origin, respectively). 

Formula (58) for o(a) follows from a suitable version of the spectral mapping 
theorem (SMT) for unbounded normal operators (see [53, Appendix E]) according 
to which a (€(0)) = c€(¢(a(0))\{1}) or, equivalently, o(€(0)) = ¢€(0(0)) (where 
in the latter formula, ¢ is viewed as a continuous C-valued function, as explained 
above). Note that for c # 1, we have that 1 ¢ o(0), so that € = €(s) is holomorphic 
and hence, continuous along o(0) = {Re(s) = c}, whereas forc = 1, € = C(s) 
is meromorphic but has a singularity (more specifically, a simple pole) ats = 1 € 
a (0). Consequently, we can use the continuous (resp., meromorphic) version of 
SMT (stated and proved in [53, Appendix E]) in order to deduce that when c ¥ 1 
(resp., c = 1), the identity (58) (resp., the counterpart of (58) when c = 1) holds. 

Next, given T > 0, one defines (still via the functional calculus for unbounded 
normal operators) a truncated infinitesimal shift 3 = a.” (say, 9 := o™(d), 
where y™ is a suitable cut-off function), so that (again by the aforementioned 
SMT), o(0™) = [ce — iT,c + iT], where c € R andi := V—1. (We refer 
to [49, 50, 53] for the precise definitions; see also Remark 7.24 just below.) 
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Then, letting a” := (3), we obtain the truncated spectral operator a = a”. 


We say that the spectral operator a is quasi-invertible if each of its truncations a” 
is invertible (in the usual sense, for bounded operators), for every T > 0. (Note that 
the normal operator 0 is bounded since its spectrum is compact.) 


Remark 7.24. For c # 1, the cut-off function gp : o(0) = {Re(s) = c} > C 
is assumed to satisfy the following two conditions: (i) g” is continuous and, in 
addition, (ii) cL(g@™ ({Re(s) = c})) = [c —iT,c + iT]). For c = 1, we replace (i) 
by (i’) : ¢™ has a (necessarily unique) meromorphic continuation to a connected 
open neighborhood of {Re(s) = 1}. (Then, condition (i’) and (ii) imply that (i) also 
holds.) Consequently, when c # | (resp., c =1), one can use the continuous (resp., 
meromorphic) version of the SMT (see [53, Appendix E]) in order to deduce that 


o(8) = o(g (A) (59) 
= cl(y™ ({Re(s) = c})) = [c- iT, c + iT], 
and especially, that 
a(a) = o(f(8)) (60) 
= cl(E(0(8™))) = cE (le — iT, ¢ + iT), 
where for c = 1, one should remove {1} from o(0™) = [c — iT,c + iT] in the 


last two equalities of (60), while when c ¥ 1, (60) takes the following simpler form 
(since then, ¢ is continuous along the vertical line segment [c — iT, c + iT]): 


o(a™) = &([c —iT,c + iT]). (61) 


When c = 1, (61) can be replaced by the following identity, which is equivalent 
to (60) interpreted as above (since €(1) = {00} and ¢(s) 4 coforalls e C,s 41): 


&(a™) := o(a™) U {oo} = E([1 — iT, 1 + iT]), (62) 


where (as in an earlier comment) the meromorphic function ¢ should be viewed 
as a C-valued continuous function in the right-hand side of (62) and where a(a”) 
denotes the extended spectrum of a”), 


The full strength of the definition given in the following remark will be needed 
when we discuss Theorem 7.29 below. This definition and the accompanying 
property is also used in the statement and the proof of Theorem 7.26 (and hence, 
of Theorem 7.27 as well). 


Remark 7.25. A possibly unbounded linear operator L : D(L) C H — H ona 
Hilbert space H, with domain D(L), is said to be invertible if it is a bijection from 
D(L) onto H and if its inverse, L~', is bounded. (If L is closed, an assumption 
which is satisfied by all of the operators considered in this section, then L~! is 
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automatically bounded, by the closed graph theorem [35, 67, 151].) Furthermore, 
essentially by definition of the spectrum, L is invertible if and only if 0 € o(L). 
(See, e.g., [67, 147, 152, 158].) 


We can now state the counterparts of Theorems 6.14 and 6.15 (the key results 
from [87] discussed in Sect. 6 above) in this context. (See Remark 7.28 below.) 


Theorem 7.26 (Analog of Theorem 6.14 [50, 53]).. Given any c € R, the spectral 
operator d = d¢ is quasi-invertible if and only if ¢ = ¢(s) does not have any 
zeros along the vertical line {Re(s) = c}; i.e., if and only the “c-partial Riemann 
hypothesis” is true. 


Hence, much as in Corollary 6.16, a,/2 is not quasi-invertible (since ¢ has at 
least one zero along the critical line {Re(s) = 1/2}).'° Furthermore, a, is quasi- 
invertible for every c > 1. Actually, for c > 1, a = a, is invertible (which implies 
that it is quasi-invertible) and its inverse is given by a~! = IIe g(l — p~°), with 
the convergence of the infinite product holding in @(H_,); see [52, 53]. 

Just as Theorem 6.15 is a consequence of Theorem 6.14, the following key result 
follows from Theorem 7.26. 


Theorem 7.27 (Analog of Theorem 6.15 [50,53]).. The spectral operator is quasi- 
invertible for every value of c in (0,1) other than in the midfractal case when c = 
1/2 (or, equivalently, for every c € (0, 1/2)) if and only if the Riemann hypothesis 
is true. 


Remark 7.28. Recall from Sect.6 that Theorems 6.14 and 6.15 are expressed in 
terms of the solvability of the inverse spectral problem (ISP)p for all fractal strings 
of Minkowski dimension D. Here, the role played by the dimension D is now played 
by the parameter c, while the solvability of (ISP)p is now replaced by the quasi- 
invertibility of a,. In a precise sense, c is the analog of D in the present context; this 
analogy is based in part on some of the results of [94] which are briefly discussed in 
Sect. 3 above (see Theorem 3.3 and the comments following it), provided one thinks 
of the functions f = f(t) as geometric counting functions of fractal strings (modulo 
the change of variable x = e’). 


We leave it to the interested reader to state (in the present context) the counterpart 
of Theorem 6.17 (about a mathematical phase transition at c = 1/2). 

Like Theorem 6.15, Theorem 7.27 is a symmetric criterion for the Riemann 
hypothesis (RH). Indeed, in light of the functional equation (28) for ¢ connecting 
€(s) and ¢(1—s), we can replace the open interval (0, 1/2) by the symmetric interval 
(1/2, 1), with respect to 1/2. (See also Eq. (29) above.) In contrast, the author has 
recently discovered the following asymmetric criterion for RH, as we now explain. 
(See [80].) 


'3Jn fact, ¢ has infinitely many zeros along the critical line, according to Hardy’s theorem (see 
[29, 174]) but this is irrelevant here. We refer to [50, 53] for a version of Theorem 7.26 for which 
this well-known fact actually matters. 
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Let b = b, be the nonnegative self-adjoint operator defined by b := aa* = a*a. 
Note that 6 is a nonnegative self-adjoint operator because a is normal (see, e.g., 
[67]) and, like a itself, is unbounded for all c € (0,1), while it is bounded for all 
c > | (see [49, 50, 53]). Then, 6 is invertible (in the sense of possibly unbounded 
operators, see Remark 7.25 above) if and only if 6 is bounded away from zero (i.e., 
if and only if there exists y > 0 such that (6f,f). > y|lf|[2, for all f € D(b)). 
Furthermore, 6 is invertible if and only if a is invertible, which is the case if and 
only 0 ¢ o(a) or, equivalently, 0 ¢ o(6). 


Theorem 7.29 (An Asymmetric Reformulation of the Riemann Hypothesis 
[80]). The following statements are equivalent: 


(i) The Riemann hypothesis is true. 

(ii) The spectral operator a is invertible for all c € (0, 1/2). 
(iii) The self-adjoint operator 6 is invertible for all c € (0, 1/2). 
(iv) For eachc € (0, 1/2), 6 is bounded away from zero. 


We know that in part (iv) of Theorem 7.29, the implicit lower bound y = y, may 
vary with c and even tend to 0 as c > (1/2)7 orc > OF. 

A priori, the phase transition observed at c = 1/2 in Theorem 7.29 just above 
is of a very different nature from the one observed in Theorem 6.15 (based on [87], 
see also Theorem 6.17) or in Theorem 7.27 (based on [50, 53], see the comment 
following Remark 7.28). Indeed, under RH (and, in fact, if and only RH holds, by 
Theorem 7.29), we have that, for all c € (0,1/2), o(a) is a closed unbounded 
subset of C not containing 0 and hence, a is invertible. This follows from the 
expression (58) obtained in [49, 50, 53] for o(a), combined with a conditional result 
(by GarunkStis and Steuding; see the proof of Lemma 4 and Proposition 5 in [37]) 
about the non-universality of ¢ in the left critical strip {0 < Re(s) < 1/2}. 

On the other hand, for any c € (1/2,1), we have that o(a) = C and hence, 
that a is not invertible. This follows again from (58), but now combined with the 
Bohr—Courant theorem [11] according to which for every c € (1/2, 1), the range of 
é along any vertical line {Re(s) = c} is dense in C. This latter fact is a consequence 
of the universality of ¢ in the right critical strip {1/2 < Re(s) < 1}, as was first 
observed and established by Voronin in [177, 178]. 

Consequently, the (unconditional) universality of the Riemann zeta function ¢ in 
the right critical strip {1/2 < Re(s) < 1} and the (conditional) non-universality 
of ¢ in the left critical strip {0 < Re(s) < 1/2} are absolutely crucial in order to 
understand the mathematical phase transition occurring in the midfractal case when 
c = 1/2, according to Theorem 7.29, if and only RH holds. 

We stress that the universality of ¢, initially discovered by Voronin in [178] in 
the mid-1970s after he extended the Bohr—Courant theorem to jets of ¢ (consisting 
of ¢ and its derivatives), in [177], has since been generalized in a number of ways; 
see, e.g., the books [66, 117, 171], along with [2, 3, 51, 53, 148, 149, 170] and the 
many relevant references therein. 

Roughly speaking, the universality theorem, as generalized in [2, 3, 148], states 
that given any compact subset K of the right critical strip {1/2 < Re(s) < l} 
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with connected complement in C, every C-valued continuous function on K which 
is holomorphic and nowhere vanishing on the interior of K can be uniformly 
approximated on K by vertical translates of ¢. 

The universality theorem has been extended to a large class of L-functions for 
which the GRH is expected to hold; see, e.g., [171] for an exposition, along with 
[170] and the relevant appendices of [51, 53]. Accordingly, Theorem 7.29 can be 
extended to many L-functions. Similarly, but for different and simpler reasons, after 
a suitable modification in the definition of the truncated infinitesimal shift a, 
Theorems 7.26 and 7.27 (based on [49-51, 53]) can be extended to essentially 
all of the Z-functions (or arithmetic zeta functions) for which GRH is expected to 
hold (independently of whether or not the analog of the universality theorem holds 
for those zeta functions); see, e.g., [131, 132, 157], [101, Appendix A] and [79, 
Appendices B,C,E]. 

Moreover, we point out that a counterpart in the present context [that is, for 
the spectral operator a = ¢€(0)] of the universality of ¢ is provided in [51, 53]. 
Interestingly, the corresponding “quantized universality” then involves the family 
of truncated spectral operators {a = ¢(0)}7.0; that is, the complex variable s 
is not replaced by the operator 0, as one might naively expect, but by the family 
of truncated infinitesimal shifts {07.9 discussed earlier in this section; see [51] 
and [53]. 

In addition, we note that the (conditional) phase transition at c = 1/2 associated 
with Theorem 7.29 (and also with Theorem 6.15, from [86, 87], which preceded 
the work in [14, 15]; see also its interpretation given in Theorem 6.17, from [75, 
76]) is of a very different nature from the one studied by Bost and Connes in [14, 
15]. (See also [22, §V.11].) Indeed, the latter phase transition has nothing to do 
with the universality or with the critical zeros of ¢ but instead, is merely connected 
with the pole of C(s) at s = 1.'4 However, it is natural to wonder whether the 
conditional phase transition occurring at c = 1/2 in Theorem 7.29 (if and only if 
RH holds) can be interpreted physically and mathematically (as in [14, 15]) as some 
kind of “symmetry breaking” associated with a suitable physical model (in quantum 
statistical physics or quantum field theory, for example) as well as corresponding to 
a change in the nature of an appropriate symmetry group yet to be attached to the 
present situation. 


Remark 7.30. tis also natural to wonder what is the inverse of the spectral operator, 
when 0 < c < 1/2 and assuming that RH holds. Naturally, according to the 
functional calculus, we must have a~! = (1/€)(d), even if a~! is not bounded (i.e., 
even if D(a~') # H, or equivalently, if R(a) 4 H,, where R(a) denotes the range of 
a). We conjecture that under RH, we have for all c € (0, 1/2) and all f € D(a!) = 


H., a l(f) = ©, w(n)n~*(f), so that a!) = °°, w(n)f(t — logn), both 


'4 4 similar phase transition (or “symmetry breaking”) is observed at c = 1, as is discussed in [49- 
51, 53], since a is bounded and invertible for c > 1, unbounded and not invertible for 1/2 < c < 1, 
while, likewise, o(a) is a compact subset of C not containing the origin for c > 1 and o(a) = Cis 
unbounded and contains the origin if 1/2 <c < 1. 
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for a.e. t € R and as an identity between functions in H,. (Here, 44 = ju(n) denotes 
the classic Mébius function, given by (mn) = 1 or —1, respectively, ifn > 2 isa 
square-free integer that is a product of an even or odd number of distinct primes, 
respectively, and j4(n) = 0 otherwise; see, e.g., [29] or [174].) Observe that if that 
were the case, then “quantized Dirichlet series” would behave very differently from 
ordinary (complex-valued) Dirichlet series. In particular, the quantized Riemann 
zeta function a = ¢(0) would be rather different from its classic counterpart, the 
ordinary Riemann zeta function, = €(s) (where s € C). Indeed, as is well known, 
even if the Riemann hypothesis is true, the series )°°°, w(n) n~* cannot converge 
for any value of s := so with 0 < Re(so) < 1/2. Otherwise, it would follow from 
the standard properties of (numerical) Dirichlet series and the Mobius inversion 
formula (see, e.g., [29]) that the sum of the series )°°2 , 4(n)n~* would have to 
be holomorphic and to coincide with (1/C)(s) in the half-plane {Re(s) > Re(so)}, 
which is, of course, impossible because the meromorphic function 1/¢ must have 
a pole at every zero of ¢ along the critical line {Re(s) = 1/2} (of which there are 
infinitely many). 


We close this discussion by a final comment related to the operator 6 (appearing in 
the statement of parts (iii) and (iv) of Theorem 7.29) and to the possible origin of 
the (conditional) phase transition occurring in the midfractal case c = 1/2. 


Remark 7.31. The nonnegative self-adjoint operator 6 = aa* = a*a is given by 
the following formula: 


H(A) = £20) YF (r= tog), (63) 


(k,n)=1 


where the sum is taken over all pairs of integers k,n > 1 without common factor. 
The appearance of the terms n~*° (for n > 1) and of the factor (2c) are very 
interesting features of this formula (which first appeared in [100, 101, §6.3.2] 
without formal justification). A priori, the above formula was derived (in [52, 53], 
motivated in part by the new result of [80] described in Theorem 7.29 above) by 
assuming that c > 1. However, the series }°°° , n~*¢ is convergent for c > 1/2 and 
the term ¢(2c) is singular only at c = 1/2, due to the pole of f(s) at s = 1. This 
may shed new light on the origin of the phase transition occurring at c = 1/2 in the 
statement of Theorem 7.29. 

Given the reformulation of the Riemann hypothesis provided above in terms of 
the invertibility of the spectral operator b = b,, where b = a*a = aa* (see 
the equivalence of (1), (iii), and (iv) in Theorem 7.29), it is tempting to extend 
formula (63) to a suitable class @ of “test functions” f = f(t) (technically, a 
“core” for the operator b, that is, a dense subspace of D(6) in the Hilbert graph 
norm ||{f||le = (IFI|? + [|2f1|2)!/7) so that it becomes valid for all c € (0, 1/2). 
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More specifically, for each c € (0, 1/2), we would like to show that there exists a 
constant y = y, > 0 (which may depend on the parameter c) such that 


lOflle = vilflle. (64) 


for all f € @. (Compare with part (iv) of Theorem 7.29 above as well as with the 
comments preceding the statement of Theorem 7.29.) 

In light of the new asymmetric criterion for RH obtained in [80] (see Theo- 
rem 7.29 and, especially, the equivalence of (i) and (iv) in Theorem 7.29), this 
conjectured inequality (64) would imply (in fact, would be equivalent to) the 
Riemann hypothesis. Thus far, however, the author has only been able to verify 
it for a certain class of test functions, unfortunately not yet large enough to fulfill 
the required conditions. 


A last comment is in order. 


Remark 7.32. Tn two works in preparation [81, 82], the author explores some of the 
applications of this new formalism (“quantized number theory” from [49—53, 80]) to 
potentially reformulating the Weyl conjectures (for curves and higher-dimensional 
varieties over finite fields or, equivalently, for function fields),'> as well as for 
constructing generalized Polya—Hilbert operators!° with spectra the Riemann zeros 
or, in fact, the zeros (and the poles) of general L-functions. For this purpose, 
Lapidus has introduced a related but different formalism, in [81, 82] associated 
with harmonic analysis and operator theory in weighted Bergman spaces of analytic 
functions (see, e.g., [1, 48]).!7 On the one hand, this new functional analytic 
framework offers greater flexibility and ease of use, since it only involves bounded 
operators, albeit of an unusual nature. On the other hand, the framework (from [49- 
53, 80]) discussed in this section presents the significant advantage, in particular, of 
naturally parametrizing the critical strip 0 < Re(s) < 1 by means of the dimension 
parameter c € (0,1). Only future research on both approaches will help us to 
eventually determine whether one formalism should be preferred to the other one, 
or as is more likely to happen, whether both formalisms should be used in order to 
further develop different aspects of quantized number theory. 
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Sums of Two Squares in Short Intervals 


James Maynard 


Dedicated to Helmut Maier on the occasion of his 60th birthday 
Abstract We show that there are short intervals [x,x + y] containing >> y!/!° 
numbers expressible as the sum of two squares, which is many more than the average 
when y = o((logx)*/°). We obtain similar results for sums of two squares in short 
arithmetic progressions. 


1 Introduction 


Let X be large and y € (0, X]. The prime number theorem shows that on average for 
x € [X, 2X] we have 


a(x + y)— (x) & (1) 


logx’ 
The approximation (1) is known ([{10], improving on [12]) to hold in the sense of 
asymptotic equivalence for all x € [X,2X] provided that X’/!? < y < X, whilst 
the celebrated result of Maier [13] shows that (1) does not hold in the sense of 


asymptotic equivalence for all x € [X, 2X] when y is of size (log X)“, for any fixed 
A > 0. He showed that for y = (log X)“, there exists x € [X, 2X] such that 


(x+y) — x(x) > (1+ ee (2) 


for a positive constant 6, depending only on A (and he also obtained similar results 
for intervals containing fewer than the average number of primes). 
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It is trivial that in the range y < log X, (1) cannot hold in the sense of asymptotic 
equivalence, and the extent to which this approximation fails is closely related to the 
study of gaps between primes. Maier and Stewart [15] combined the Erdés—Rankin 
construction for large gaps between primes with the work of Maier to get show 
the existence of intervals containing significantly fewer primes than the average. It 
follows from the recent advances in the study of small gaps between primes ([16, 
18, 21] and unpublished work of Tao) that there are also short intervals containing 
significantly more primes than the average. 

The situation is similar for the set .” of integers representable by the sum of two 
squares. Hooley [11] has shown that for almost all x € [X, 2X] one has 


Gy Gy 
== K Yo 1K 5. (3) 
log . ne SN [x,x+y] log . 


provided y/./log X — oo (here G > 0 is an absolute constant so that the above 
sum is (1 + o(1))Gy/./log X on average over x € [X, 2X]). 

By adapting Maier’s method, Balog and Wooley [1] showed that for any fixed 
A > Oand y = (log X)", there exists 6’, > 0 and an x € [X, 2X] such that 


Gy 
yY > 12> (+64), 
ne SN [x,x+y] log x 


(4) 


and so Hooley’s result cannot be strengthened to an asymptotic which holds for all 
x € [X, 2X] when y is of size (log x)*. 

As a result of the “GPY sieve”, Graham, Goldston, Pintz and Yildirim [6] have 
shown that for any fixed m there exists x € [X, 2X] such that [x, x + O,,(1)] contains 
m numbers which are the product of two primes = 1 (mod 4), and so m elements of 
-/. By modifying the GPY sieve to find short intervals containing many elements 
of .Y, we show the existence of intervals [x,x + y] containing a higher order of 
magnitude than the average number when y is of size o((log X)°/°). Thus in this 
range we see that the upper bound in Hooley’s result cannot hold for all x € [X, 2X]. 
(It is a classical result that the lower bound cannot hold for all x in such a range; the 
strongest such result is due to Richards [20] who has shown there exists x € [X, 2X] 
such that there are no integers n € .¥ in an interval [x, x + O(log x)].) 


2 Intervals with Many Primes or Sums of Two Squares 


Let the function g : (0,00) — R be defined by 


g(t) = supe” (u), (5) 
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where y is the Euler constant and w(u) is the Buchstab function.' Clearly g is 
decreasing, and it is known that g(t) > 1 for allt > 0. 

We first recall the current state of knowledge regarding short intervals and short 
arithmetic progressions containing unusually many primes 


Theorem 2.1. Fix « > 0. There is a constant co(€) > 0 such that we have the 
following: 


1. Let X,y satisfy co(€) < y < X. Then there exist at least X'!~'/'8!°8* values 
x € [X, 2X] such that 


logy one) y 


m(x + y)— ma) 2 Cee y logx’ 


2. Fix a € N, and let Q,x satisfy co(€) < Q < x/2. Then there exist at least 
Q!-l/leslos@ values q € [Q,2Q] such that 


logx/q ) ae gee Seee)) x 
log log x x/q $(q) logx” 


3. Let q,x satisfy co(€) < q < x/2, and let q have no prime factors less than 
log. x/ log log.x. Then there exist at least q'~'/'°8'£4 integers a € [1, q] such that 


rlasa.0) > (e( 


logx/q ) aga gueeMn)) x 


mea.) = (e(cp tops x/q 6(q)logx’ 


Here c > 0 is an absolute constant. 


We do not claim Theorem 2.1 is new; all the statements of Theorem 2.1 follow 
immediately from the work [5, 13] and [16]. 

When the number of integers in the short interval or arithmetic progression is 
large compared with log x log log x, the first terms in parentheses dominate the right- 
hand sides, and we obtain the results of Maier [13] and Friedlander and Granville [5] 
that there are many intervals and arithmetic progressions which contain more than 
the average number of primes by a constant factor. When the number of integers is 
small compared with log x log log x, the final terms dominate, and we see that there 
are many intervals and arithmetic progressions which contain a number of primes 
which is a higher order of magnitude than the average number. 

Part 3 of Theorem 2.1 is one of the key ingredients in recent work of the 
author on large gaps between primes [17]. The previous best lower bound for the 
largest gaps between consecutive primes bounded by x was due to Pintz [19] who 
explicitly indicated that one obstruction to improving his work was the inability of 
previous techniques to show the existence of many primes in such a short arithmetic 


'The Buchstab function w(w) is defined by the delay-differential equation 


o(u) =u! (0<u<2), > (wow) =o(u-—1) (u>2). 
u 
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progression with prime modulus (this issue was also clear in the key work of 
Maier and Pomerance [14] on this problem). The independent improvement on this 
bound by Ford, Green, Konyagin and Tao [3] used the Green—Tao technology to 
show there are many primes g for which there were many primes in an arithmetic 
progression modulo gq, although it appears this approach does not give improvements 
to Theorem 2.1. 

The aim of this paper is to establish an analogous statement to Theorem 2.1 for 
the set .Y of numbers representable as the sum of two squares. We first require some 
notation. We define the counting functions S(x) and S(x; q, a), the constant G and 
the multiplicative function 6.7 by 


S(x) = #{n<x:ne JF}, (6) 
SQ; q,a) = #{n <x:ne€.Y%,n =a (mod g)}, (7) 
1 1\-1/2 
6=— |] (Q@-s) . (8) 
V2 23 (mod | P ) 
p, p =1 (mod 4), 
‘ etl /(p +1), p=3 (mod 4), 
a= Cae ), p=3( ) (0) 
2, p= 2ande => 2, 
2, po =2. 


With this notation, the average number S(x + y) — S(x) of elements of .7 in a short 
interval [x,x + y] is ~ Gy/./log x and the average number S(x; q, a) of elements of 
Y in the arithmetic progression a (mod gq) for (a,q) = 1 anda = 1 (mod (4, q)) 
which are less than x is ~ Gx/(¢.¥(q)/log x). 

Finally, we let F be the sieve function occurring in the upper bound half 
dimensional sieve, that is the function defined by the delay-differential equations 


1 
F(s) = forO<s <2, (s\/?F(s)) = oo 1) for s > 2, 


e 
q1/251/2 
1 
f(s) =O0for0<s <1, (s!/2F(s))’ = 58 OF(s —1)fors>1. 
(10) 
It is known that 1 < F(s) < 1 + O(e~*) for all s > 0. 


Theorem 2.2. Fix € > 0, and let Y denote the set of integers representable as the 
sum of two squares. There is a constant co(€) > 0 such that we have the following: 


1. Let X,y satisfy co(€) < y < X. Then there exists at least X'!~'/!8!8* values 
x € [X, 2X] such that 


logy ye =) Gy 


S(@@+y)-S@) = (Ge os ne ete 
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2. Fixa € YY, and let Q,x satisfy co(e) < Q < x/2. Then there exists at least 
Q!-l/leslos 2 values q € [Q,2Q] such that 


log x log x)!/? Gx 
Steg (H( 2 cede) | 
log log x (x/q)?!"°/ by (q)(ogx)'/? 
3. Let q,x satisfy co(€) < q < x/2, and let q have no prime factors less than 
log x/loglogx. Then there exists at least q\~'/2'°84 integers a € [1,q] such 
that 


a) —— Gx 
—e€ 


Sina) = (Fog: C/N) 67 @Mlogay? 


Here c > O is an absolute constant. 


Theorem 2.2 is our equivalent of Theorem 2.1 for numbers expressible as the sum 
of two squares. When the number of integers in the short interval or arithmetic 
progression is large compared with (logx)*/°, we find that there more than the 
average number of integers representable as the sum of two square, which is a result 
of Balog and Wooley [1], based on Maier’s method. When the number of integers 
is small compared with (log x)°/°, then the number of integers representable as the 
sum of two squares is of a higher order of magnitude than the average. We note that 
unlike Theorem 2.1, Theorem 2.2 improves on the Maier matrix bounds in a range 
beyond the trivial region by a full positive power of log x. 

The constant 9/10 appearing in the exponent in the denominators of the final 
terms of Theorem 2.2 is certainly not optimal; with slightly more effort, one could 
certainly improve this. We satisfy ourselves with a weaker bound for simplicity here, 
the key point of interest being one can obtain a constant less than |, and so there are 
intervals of length considerably larger than the average gap between elements of .” 
which contain a higher order of magnitude than the average number of elements. 

Analogously to the work of Granville and Soundararajan [8], we believe the 
phenomenon of significant failures of equidistribution in short intervals should hold 
in rather more general “arithmetic sequences” which are strongly equidistributed 
in arithmetic progressions (in the sense of a weak Bombieri—Vinogradov type 
theorem). We hope to return to this in a future paper. 


3 Overview 


We give a rough overview of the methods, emphasizing the similarities between 
Maier’s matrix method and the GPY method. For simplicity we concentrate on the 
case when we are looking for primes in short intervals; the arguments for arithmetic 
progressions are similar, and when looking for sums of two squares one wishes to 
“sieve out” only primes congruent to 3 (mod 4). 
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We estimate a weighted average of the number of primes in a short interval, and 
wish to show that this weighted average is larger than what one would obtain if the 
primes were evenly distributed. More specifically, we consider the ratio 


( > af nine in lpn) /( > ex”) (11) 


X<n<2X X<n<2X 


where w,, are some non-negative weights chosen to be large when the interval [n, n+ 
h] contains many numbers with no small prime factors, and small otherwise. 

If the ratio (11) is greater than some constant co, then it is clear that there must 
be at least one n € [X, 2X] such that [n,n + h] contains at least coh/ log x primes 
(i.e. Co times the average number). Thus we aim to choose w,, to make this ratio as 
large as possible. 

In the range h = o(logX), we choose the weights w, to be concentrated on 
integers n such that the interval [n,n + h] only” contains numbers with no small 
prime factors.* To achieve this we choose w, to mimic weights of Selberg’s A?- 
sieve 


wn (Yo Attenth) (12) 


d;|n+i 


where Aq, ....q, are real numbers chosen later to optimize the final result. 

In the range h ~ (log x)* for some constant A > 1 it is unavoidable that a large 
number of elements of [n,n + h] will have small prime factors, and so we require a 
slightly different choice of the weights w,,. Instead we observe that if 7 is a multiple 
of primes less than z = (log x) (where 4’ < 1), then the probability that a random 
element of [n,n + h] is coprime to all these primes fluctuates depending on the size 
of 1/2’, and can be more than the average for general n. More specifically, we take 


(13) 
0, otherwise. 


ia " n = 0 (mod [],.,p), 
The number of elements of [n,n + h] which have no prime factors less than z is 
given by 


. : logh p-1 
j _ pl 
#4 €[1,h]: (j,p) = 1Vp <ai~e (er) : (14) 


p<z 


?We ignore the elements of [n,n + h] which are a multiple of some prime p < h, which is 
unavoidable. 


3Here a “small prime” refers to one bounded by roughly exp(h7!+°) log x). 
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for z > oo and z < h!~€ (here a is the Buchstab function). Thus, given h we can 
choose z so that the number of elements coprime to small primes is larger than the 
expected number in a random interval by a factor e” w(log h/ log z) > 1. 

Given this choice of weights, to estimate the numerator in (11) we rewrite the 
count of the primes as a sum of the indicator function of the primes, and swap the 
order of summation. This gives 


h 
> Yo It /wn. (15) 


j=l X<n<2X 


(Here, and throughout this paper, 1,v will denote the indicator function of a 
set &, in this case the set P of primes.) Our choice of w, means that we can 
estimate this expression using knowledge of the distribution of primes in arithmetic 
progressions. The inner sum vanishes or has a simple asymptotic expression 
depending on the arithmetic structure of j. In the case of the Maier weights, this leads 
us to (14), which gives our asymptotic for the numerator, and hence the ratio (11). 
In the range h = o(log X), we obtain a quadratic form in the coefficients Aq, a, 
for the numerator, and similarly a quadratic form for the denominator. We then 
make a choice of these Ay,.....4, Which (approximately) maximizes the ratio of these 
quadratic forms. 


Remark 3.1. One can heuristically investigate Selberg-sieve weights (12) for larger 
h, and (assuming various error terms are negligible) conclude that an approximately 
optimal choice of weights Ag,,.a, Should be given by Ag, * UW(di...dx) if 
[ls d; has all of its prime factors less than a small multiple of logx (and 0 
otherwise). This corresponds precisely to Maier’s choice of weights (13). 


4 Admissible Sets of Linear Functions 


The bounds involving the sieve function F in Theorem 2.2 follow from the argument 
of Balog and Wooley [1], and by making minor adaptions analogous to the work of 
Friedlander—Granville [5]. We sketch such arguments in Sect. 5. 

Therefore the main task of this paper is to establish the bounds coming from 
the final terms in parentheses of Theorem 2.2. We will do this by an adaptation 
of the GPY sieve method. Indeed, we will actually prove a rather stronger result, 
given by Theorem 4.3 below, which is roughly analogous to the main theorem of 
Maynard [16]. 


“One needs to be slightly careful about the possible effect of Siegel zeros here, but this is a minor 
technical issue. 
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To ease notation, we let (P,) denote the set of integers composed only of primes 
congruent to 1 (mod 4). Similarly let (P3) those composed of primes congruent to 
3 (mod 4). 


Definition 4.1 ((P3)-Admissibility). We say a set 2 = {L),...,L,} of distinct 
linear functions L;(n) = ajn + b; with integer coefficients is (P3)-admissible if for 
every prime p = 3 (mod 4) there is an integer n, such that ly Li(np),p) = 1, 
and if a;,b; > O for all i. 


Theorem 4.3. Let x be sufficiently large and k < (logx)!/°. Then there is a prime 
Po > loglogx such that the following holds. 

Let 2 = {Li,..., Lx} be a (P3)-admissible set of linear functions such that the 
coefficients of Li(n) = ain + b; satisfy 0 < a; < (logx)'/7, (2po,a;) = 1, and 
0 < b; < x. There exists an absolute constant C > 0 such that 


#{n € [x, 2x]: at least k'/?/C of Li(n),...,L,(n) are in SY} > xexp(—(logx)!””). 


We remark that the conditions k < (log X)!/°, 2+ aj, a; < (log X)!/3 and b; < x are 
considerably weaker than what our method requires, but simplify some of the later 
arguments slightly and are sufficient for our application. 

In the region where y, x/Q and x/q are small compared with (log x)*/°, the 
bounds in Theorem 2.2 follow easily from Theorem 4.3. For sums of squares in short 
intervals, we consider y < (log x)°/9+*. We take k = y!/>, and let Z = {Lj,..., Ly} 
be the (P3)-admissible set of linear functions L;(n) = n+ h;, where hy,..., hx 
are the first k positive integers in (P,) which are coprime with po. (This is (P3)- 
admissible, since L;(0) is coprime to all primes p = 3 (mod 4) for all i.) We note 
that hy, < k(logk)!/? < y. It then follows immediately from Theorem 4.3 that there 
are > X exp(—(log X)!/?) values of n € [X,2X] such that >> k!/? = y'/!° of the 
L(n) are in Y, and hence >> X!~!/!s!e2X values of x € [X, 2X] such that [x,x + y] 
contains at least y!/!° elements of 7. 

Parts 2 and 3 of Theorem 2.2 follow analogously, defining k = (x/Q)!/> and 
L(n) = a+ hin when Q > x(logx)~>/°~* for part 2, and defining k = (x/q)!/!° 
and Li(n) = n+ hig when q > x(logx)~*/°~€ for part 3. We note that it follows 
from Theorem 2.2 that actually there are many intervals [x,x + y] containing > 
y!/? (log y)!/4 elements of .Y (rather than >> y!/!°), for the smaller range 1 < y < 
(log x)!/°. 


5 Irregularities from the Maier Matrix Method 


As previously mentioned, the bounds in Theorem 2.2 when y, x/Q or x/q large 
compared with (log x)°/? follow from minor adaptions of the argument of Balog 
and Wooley [1]. In particular, the relevant bound for part 2 of Theorem 2.2 follows 
from the proof of [1, Theorem 1]. We give brief outline of the argument for parts 2 
and 3, leaving the details to the interested reader. 
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We first consider part 2 of Theorem 2.2. We assume that (log x)°/?-* < x/Q < 
(log X)°" since otherwise either the result follows from Theorem 4.3, or the result 
is trivial. For simplicity we shall assume that a is odd; the case for even a is entirely 
analogous. We let P = II,=3 (mod 4),p<cP"”, where a is the least odd integer such 
that p% > a(4x/Q + 1) and z = log x/(log log x)”. We note that P = Q°!/leg!08 2), 
Finally, we let 2 = {q € [11 —5)Q + 4a, Q + 4a] : g = 4a (mod 4P)} for some 
small fixed constant 6 = 6(€) > 0. We have 


Vi s@gaa> YL Yo1y@+rq) 


qeQ r<x/Q-1 qeQ 


? = 1y(a(4r + 1) + 4mrP). (16) 


r<x/Q—1 mé[(1—5)Q/4P,Q/4P] 


The inner sum is counting elements of .¥ in an arithmetic progression to modulus 
4rP. We note that by construction of P we must have (P, a) = e” for some e € (P3) 
since a € .Y, and similarly if the inner sum is non-empty we must have (P, a(4r + 
1)) = e?d? since a(4r + 1) + 4mrP € .Y. Moreover P/(e?d”) and P are composed 
of the same prime factors, since each prime occurs in P with odd multiplicity. Thus, 
if r is such that (P, 4r + 1) = d?, the inner sum is 


(1 + o(1)) Sree) ~ S80 (17) 
gy (4rP/(d2e?)) \/log rO/d2e2 26.7 (P)/Jlog x 
Therefore, letting ud” = 4r + 1, we obtain 
Y> S(x:q,a) = Se ee » (18) 


26(P)/logx 

on b7 (P)/logx a |P u<(4x/Q+1)/d? 

u=1 (mod 4) 
(u,P)=1 


This double sum is exactly the sum &* which is estimated in [1, Proof of 
Lemma 4.3]. In particular, they show that 


P logx 
> y- io oy( r( g -), (19) 
: . P logz 
a|P u<(4x/O+1)/d 
u=1 (mod 4) 


(u,P)=1 


Recalling that z = log x/(log log x)’, one arrives at (for 6 sufficiently small) 


. _ logx/q\ _ Gx x 
© (5 #0) —(F(cpiogs) —*) aoe > Pigs 


qeQ 


This implies the relevant bound in part 2 of Theorem 2.2. 
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We now consider part 3 of Theorem 2.2. We let P = II,=3 ied Aigo p®> where 


B, is the least odd integer such that p4» > 4x/q, and let of = {a € [(1 — 8)q,q] : 
a = do (mod P)}. We see that 


» S(x; q, a) > » ye 1Ly(a+ rq) 
aEadh r<(x—q)/q a€eW@ 


= » Ly(ao +rq+mP). (21) 
r<x/q-1 me[(—8)q—ao)/P,(q—ao)/P] 


We see the inner sum counts elements of .7 in an arithmetic progression modulo 
P. We choose ay = g/4 (mod P), so that (a9 + rqg,P) = (4r + 1,P) since, by 
assumption, g has no prime factors in common with P. Thus (4r + 1, P) = d? for 
some d € (P3) by the same argument as above, and we obtain 


(1 + 0(1))Gdq 
>" S42) => — => Yi. (22) 
aEead o(P) logx @|P u<(4x/q—3)/@ 
u=1 (mod 4) 


(u,P)=1 


Again, using (19), we obtain 


log x/q Gx x 
¥ (ve 20) ~ (Fl igiogs) —*) 55 TE) > Plog 


acd 


which implies the relevant bound for part 3. 


6 Setup for Theorem 4.3 


The improvements of the GPY sieve in the author’s work [16, 18] are not 
significant for the application of finding sums of two squares in short intervals (the 
improvement would be y®) in the final term for part 1 of Theorem 2.2, for example), 
and so we will use a uniform version of the simplest GPY sieve. 

In order to get a result that applies in the larger range, however, it is necessary 
to modify the sieve to the application; a uniform version of the argument in [6] (i.e. 
counting numbers with two prime factors both congruent to 1 (mod 4)) would only 
give non-trivial results in the region y < (log x)!/? log log x, for example. 

We follow a similar argument to [6] (and attempt to keep the notation similar), 
but make modifications to allow for uniformity (similar to those in [7] and [16], 
although it is simpler in this context) and to specialize so as to remove only primes 
congruent to 3 (mod 4) (so we will apply sieves of “dimension” k/2 and (k + 1)/2 
instead of dimension k and k + 1). 
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In order to state our setup, we require the following lemma. 


Lemma 6.1. Let x > 10 and « > 0. There exists a prime po € [(loglogx)/2, x] 
such that 


q(x 
¥ sup |s (x; g,a) — z@) Ke xexp(—co vlog x), 
gen (a,q)=1 $(q) 
" <x 


(q.po)=1 


for some absolute constant co > 0. 


Proof. This follows from known results on the repulsion of zeros of L-functions. 
Following [2, Chap. 28], we have (for a suitable constant c > 0) 
/ 1 (x’) 
po sup n@'sq,a)—= | <«K xexp(—c Vlog x) 
/2—e (4Q=1 (9) 
q<x / / 
(q.p0)=1 


+ logx yy sup ad Wie ol (24) 


q<exp(2c./log x) WSK X (mod q) (9) 
(q.po)=1 


However, by the Landau—Page theorem [2, Chap. 20], we have that if c is sufficiently 
small then 


$(g)1 S> |W’. K xexp(—3eVlogx) (25) 
x 


for all g < exp(2c./log x) and x’ < x, except possibly those which are a multiple 
of an exceptional modulus g;. Such an exceptional modulus must satisfy logx < 
qi(log qi)* « x, and must be square-free apart from a factor of at most 4. Taking 
Do to be the largest prime factor of gq; then gives the result. Oo 


Let Z = {Li,...,Lx} be the (P3)-admissible set of Theorem 4.3, po be the prime 
given by Lemma 6.1, and let 


w= [I 2 (26) 
p<2(log X)!/3 
p=3 (mod 4) 
PFPO 
It will be convenient to choose w, to be 0 unless n = vp (mod W), where vg 
is chosen such that (Lj(vo),W) = 1 for each j. (Such a vo exists by the (P3)- 
admissibility of {Z),...,Z,} and the Chinese Remainder Theorem.) This allows 


us to ignore n for which one of the L;(n) has a small prime factor congruent to 3 
(mod 4). 
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We then define our weights w, by 


2 
_ (Da Te, Lat) ha) , ifn = vo (mod W), 


Wn (27) 
0, otherwise, 
for some coefficients Ay (which we will choose explicitly later) satisfying 
ha =0 ifd> X'/" or p?(d) = Oord ¢ (P3) or (d,poW) # 1. (28) 


We recall that (P|) and (P3) denote the sets of integers composed only of prime 
factors p = 1 (mod 4) and p = 3 (mod 4), respectively. Finally, we define the 
function v = vy on primes by 


k 
vp) = {1 <n<p:] [Li =0 (mod p)}. (29) 


i=1 


We note that since {L,..., L,} is (P3)-admissible, v(p) < p for any prime p = 3 
(mod 4). 


7 Proof of Theorem 4.3 


Lemma 7.2. Assume the hypotheses of Theorem 4.3 and that sup,;|Aal < 
(log X)*/?, Then 


[] 2 + o(x'**). 


pide 


xX 
Ww, == Kare 
Ss wr 4 


X<n<2X 
n=vo (mod W) 


Proof. From the definition of w,, we have 


Yo w= Ade YD 1 (30) 
de 


X<n<2X l X<n<2X 
n=vo (mod W) n=vo (mod W) 


d,e| [Tay Lin) 


We concentrate on the inner sum. For each prime p,|de, we must have L;(n) = 0 
(mod p;) for some 1 < i < k, and so n can lie in one of v(p;) residue classes. 
Thus, using the Chinese remainder theorem, we can rewrite the inner sum as a sum 
of Tide v(p) sums of n ina single residue class (mod W{d, e]), where [d, e] is the 
least common multiple of d and e (since Ag = 0 if (d, W) 4 1 or d is not square- 
free). The number of integers X < n < 2X counted in any such residue class is then 
X/({d, e]W) + O(1). Putting this together gives 
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Yaad Yo l= Yur) (Fag + 0(1)). (31) 
de de ? 


X<n<2X p\de 
n=vo (mod W) 


del [Th Lin) 


We see that the main term gives the corresponding main term in the statement of the 
lemma. Since v(p) < k, the contribution from the error term is 


2 Thiak\2 
« sup Leal?( > I1é) <p a(x) ) 


d<X!/10 pid 


< sup |Aq|?X'/>(1 + log x)*. (32) 
d 


By assumption of the Lemma, k < (log X)!/? and sup, |Au| « (log X)‘/?, so this 
contribution is O(X!/5**), o 


Lemma 7.3. Assume the hypotheses of Theorem 4.3 and that supg|Aa|) < 
(log X)*/?, Then 


. Xk v(p)—1 
> at : L(n) € S\Wn > $(W) (log X)'72 dre] | 75 i 


X<n<2X p\de 
n=vo (mod W) 


+ O(X exp(—c / log X)) 


for some absolute constant c > 0. 


Proof. We first obtain a lower bound by only counting a subset of #{1 < i < 
k : L,(n) € } which will be more convenient for our manipulations (we lose 
an unimportant constant factor in doing this). Let Y’ C Y be given by 


SH! = {n € (P\) :n = mp for some m < X'? and some prime p}. (33) 


We obtain a lower bound by only counting elements of .7’. This gives 


Hi: Liln) € S\Wy = HI sick: Linle Hm 


X<n<2X X<n<2X 
n=vo (mod W) n=vo (mod W) 
k 2. 
= ~ lvGm( Yo ax). 64) 
j=l X<n<2x d| Te, L;(n) 


n=vo (mod W) 


By definition, if Lj(n) = ajn + b; € ’, then L;(n) can be written uniquely as 
mp for some m < X'/3 composed only of primes congruent to 1 (mod 4) and 
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with (m,a;) = 1, and some prime p = 1 (mod 4). Making this substitution and 
rearranging the sums, we are left to estimate 


> SO, aie Ye 1. (35) 


J=1 m<x'/3 de<x!/l0 (ajX+b;)/m<p<(2ajX +b;)/m 
m€(P) d,e€(P3) p=1 (mod 4 
(m,aj)=1 p=m(ajvo+b;) (mod ajW) 


[d.ell Tf) Li((ym—b;)/aj) 


For each prime p,|de with p; > (logx)!/3, there are v(p) — 1 possible primitive 
residue classes for p (mod p,) such that Mh, L;((mp — b;)/aj) = 0 (mod py). (All 
v(p) residue classes for which Mh, L;(n) = 0 (mod p;) correspond to a primitive 
residue class for p except for n = —ajb; (mod p;), and these all exist since aj < 
(log X)!/3 < p; and p; = 3 (mod 4) so p; + m.) Thus, by the Chinese remainder 
theorem, we can rewrite the inner sum as a sum of |,,,,.(v(p) — 1) different sums 
of primes in arithmetic progressions to modulus 4a;W WF deP (since 2 + aj). Thus 
the inner sum is 


ita) ((“ *) 7 (— *)) I] a 


—1 
pide P 
3aj;X . 
+ 0(E(—: 4aWid.el) T]). (36) 
pide 
where 
j 1(x’) 

E(x;q) = sup |7(x;q9,a) — : [d, e] = | [p. (37) 

(a.g=1 $(q) pide 

x <x 


We first consider the error term of (36). Letting r = 4a;W[d, e] « X '/4 (noting that 
(r, po) = 1) and Ajax = sup, |Aq|, this contributes at most 


a a > #(**; r) [3 


m<X!/3 pex!/4 pir 
(r.po)=1 


OE Do ee)" 


< ki 
m 7 
m<X!/3 r<4wx!/5 r<4wx!/5 
(r,po)=1 


1/2 1/2 1 
2 ' 2 9K é _ 2/3 man 
K Kx (ajX(9F + logx)™) (aX exp(—co log X7/3)) —. (38) 


m<X!/3 
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Here we have used Cauchy—Schwarz and the trivial bound E(X,q) « X/qforg < X 
in the first line, and Lemma 6.1 in the second. Thus, since k < (log X)!/>, qs 
(log X)!/3 and Amax < (log X)/? by assumption of the lemma, the contribution 
from (38) is O(X exp(—¢ J/log X)). 

We now consider the contribution from the main term of (36). The main term 
factorizes, simplifying to 


iawn I] za) » Tet (DA ioe 2) (39) 


pia; ptwP m<X!/3 m pide 
mé€(P1) 
(m,aj)=1 


The second term is straightforward to evaluate. For t > X* we have that 


-1 
> 1 > t(logt)!/” I] dale (40) 
om ae: 
me (Pi) meg 4) 


(m,qj)=1 


and so by partial summation we see that 


X+o0 -1 

ar > X(logx) 7 || — (41) 
m<Xx!/3 ue 8 in Plaj ss 
mé€(P1) p=! (mod 4) 


(m,aj)=1 
Noting that the products over prime divisors of a; cancel, this completes the proof 
of the lemma. Oo 


Lemma 7.4. Fix A > 0. Let 1/A < k < (logR)!? and let f : [0,1] > R bea 
smooth non-negative function. Let y(p) satisfy 0 < y(p) < min(Ak, (1 —1/A)p) for 
all p < R, and 


] 
w<p<z 
forall2 <w<z<R. Then 


(p) r 
ye T= : a (Er) 


r<R p\r 


(log R)* i = KL + x? loglogR 
= tt" 1 -( ——___-_- 
Sy (k) Jo FO ar( - Ons( logR )). 
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where 


6, = 710-2) "(-4). 


DER P P 
Proof. For f = 1, this is a version of [9, Lemma 5.4], with the dependence on k 


kept explicit. In particular, this is obtained in the proof of [4, Lemma 5.1]. The case 
of f 4 1 then follows immediately by partial summation. Oo 


Lemma 7.5. Let k < (log X)!/°, and let Aq be defined by 
P v(p) 
da = HA] | — + A eremeperoce 


for (d, poW) = 1, where 


u(r, ifr € (Ps), (, Wpo) = 1, 7 < X'/9, 
yr 
0, otherwise. 


Then 


S > Aade | | ——— *0)— 2% (log xy?) Yi Aare] | eS 


d,e p\de de p\de 


Proof. We first note that the y, variables diagonalize the second quadratic form. 
Substituting the expression for A, in terms of y, given by then lemma, we obtain 


(p) (p) 2! 
eal a = Dee (UI; ~ ay 
x mouter(T] voy (TI Tote a) (42) 


d,e:d|r,e|s 


The sum over d, e vanishes unless r = s, and so we find that 
v(p) v(p) 
dare] |—- = Ure (43) 
de p\de - r es — v(p) 


We now introduce new variables y* to perform an analogous diagonalization for the 
first quadratic form. Let y* be zero unless r € (P3), (r, Wpo) = 1, r < X!/!° and 
v(p) > 1 for all p|r. In this case, let y* be given by 


= Hol] ae DM =), (44) 


dir|d p\d 
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We see from this that for any d satisfying the same conditions, we have 


p(dra = ( ! 7) > 2-(T] a) Yeo 
‘ 


e:dle ple P r:d|r,rle 
= pe! « v(p) - 1 
= a) 215-9). 
pid rd|r pir 


This gives Aq in terms of y*. Performing the analogous computation to (43) with y* 
in place of y,, we obtain 


sal “ = Dev] ere , (46) 


de p\de pir 


Substituting the expression for Aq in terms of y, from the statement of the lemma 
in (44) gives (for r such that y* 4 0) 


; ae () (() 
f= HOT ai) Loe) 2," RLU bron ) 


sir|s pis d:r|d, d|s 


=ur>> AE y (47) 


sir|s 


We can now use Lemma 7.4 and definition of y, from the statement of the lemma 
to evaluate this sum. We take y;(p) = 1 if p = 3 (mod 4) and (p, rppW) = 1, and 
take y;(p) = 0 otherwise. We see that for any 2 < w < z < X'/'° we have 


1 logp yi(p)logp 1 
= a +O1< > = ee 1. (48) 


P\rpoW w<z<X!/10 


The sum on the left hand side is O(log log X). Thus, by applying Lemma 7.4 to the 
sum (47), we have for squarefree r < X!/!° with r € (P3) and (r, Wpo) = 1 and 


[],-v@) — 1) > 0 that 


cs 2 vi(P) 
G62 P Olle ew 


t<X1/10/y p\t 


XW 3 log log X (~—)\-17, 13 
. OTGID ( (1 7) + o(SE2E)) ara, Cae) 
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Since r, W € (P3) and ¢(po)/po = 1 + o(]), the product simplifies to give 


r 1\ x@)/2 
Ls) , (50) 


()- ) 
colts yG-,) =a+ 0 


where y is the non-trivial character modulo 4 


We can now evaluate both of the expressions (43) and (46) using Lemma 7.4. 
For (43), we take y(p) = v(p) if p = 3 (mod 4) and (p, Wpo) = 1, and y2(p) = 0 
otherwise. We see that 

k 


logp yo(p)logp =k 
5 > O02 >) > ~ 


PlpoW [Ti4j(aibj—bja) 


log <1. (51) 
2 w 


w<p<z P 
Thus, we obtain (with k = k/2 and L « klog log X) 


yyy —2 


r<X1/10 pir aa vp) 


_ (log Se 


kK log log X 
_ (i o( og log 
r( +k/2) 


Cree) TT 2M) wp 
[ 


Similarly, for (46) we take y3(p) = (v(p) — 1)p/(p — 1) for p = 3 (mod 4) and 


(p, Wpo) = 1, and take y3(p) = 0 otherwise. We obtain (with k = (k — 1)/2 and 
L < kloglogX) 


1/10 
O(log * 


r<X1/10 


y3(p) 

O(log log X ee 

POE ) P— y3(P) 
_ (log X!/10) S* 


kK logl 3(P)\- 1 = 
7 reer ayy + (age) 0-8 ) (ie ) , 


(53) 


We note that if y3(p) 4 0, then 1 — y3(p)/p = (1 — y2(p)/p)(1 — 1/p)~'. Thus 
since W € (P3) and $(po)/po = 1 + 0(1), we have 


y3(p)\—! 1\ ‘> W y2(p)\-! 1\ = 
1- —— 1-—- ~ — 1- 1-- : 34 
TI D ) ( a) o(W) TI Dp ) ( 2 i) 
Putting the estimates (49), (50), (52)-(54) together, we obtain 
vy)y—-1 owrd + k/2)(log X1/1)1/2 1\ x@)/2 
0; ” II; —vp)—- WFB/2P? (kK + 3)/2) ITC: ~ 5) 


p<X P 
- x TI v(p) 


55 
, =O (55) 
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Stirling’s formula shows that (x) ~ V2mxx‘e™, and so for k sufficiently large 
(1 +k/2)/P((k + 3)/2) >> k-'/?. Since the product over primes is convergent as 
X — ©, this gives 


Lor PT > ee” DAT. 60 
p\r r 


pir 


Finally, from (52) and k < (log X)!/°, it follows that we have the crude bound that 
this is > 1. Oo 


Lemma 7.6. Let k < (log X)!/> and Aq be as given by Lemma 7.5. Then 
|Aal « (log X)*/?. 
Proof. This is an immediate application of Lemma 7.4. We take y4(p) = v(p) if 


p = 3 (mod 4) and (p,dWpo) = 1, and y4(p) = 0 otherwise. By Lemma 7.4 
(taking k = k/2 and L < klog log X) we have 


P 
|Aal| = (I] ae) 2 eg ore) ry 
Pp 


r<x'/l0/q 
(r,dWpo)=1 
r€(P3) 
(log X)‘/? vp)! yalp)\~17, _ 1y¥/? 
. rosa l ll) Lie Pp ) i=) 


(log X)*/? kyr! 1\ 4/2 1\ 4/2 

«=~ Tl (--) (--) I] (@--). 
rd+k/2 

( * / one P (log X)!/3 <p<x 

p=3 (mod 4) p=! (mod 4) 


Recalling that k < (log X)!/°, we see that this is 


(log X)*/? k 14+0(—p'!) k 1+ 0(—p"!) 
TO +k/2) exp( be 5 ) 


(log X)!/3<p<x P (log X)!/3 <p<x P 
p=3 (mod 4) p=! (mod 4) 
« exp(O(k))k "7 (log X)/? « (log X)/?. (58) 


oO 


Proof (Proof of Theorem 4.3). By Lemmas 7.5 and 7.6, there is a choice of 
coefficients Aq such that |Ay| « (log X)*/? and 


ddete oor "y= — k-'? dog xt Ey Ail n~ 4, 


pide de pide 
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Thus, by Lemmas 7.2 and 7.3, this choice of Ay corresponds to a choice of w, > 0 
such that 


> HI <isk: Line hw, > ki? yw, > 


X<n<2X X<n<2X Wlogx 
n=vo (mod W) n=vo (mod W) 
(59) 
Lemma 7.6 shows |Ay| < (log X)/?, so we have 
2 
Wa X (log X)*( 1) (60) 


aT] Lim) 
Ag#O0 


Thus 


4 
~ wiKdosxy*® Yo (Tr@) Yo 1 


X<n<2X dy dy.d3,d4<X'/0 i=1 X<n<2X 
: k 
d dy ,d3.d4\ []j;= Li(n) 


4 
«xox (TJ) TT & 


d ,dz,d3,d4<X'/10 i=] p\d\dzd3d4 
15k2)X 2 
< X(log X)* y ice la <« X(logx)!* . (61) 
r 
r<X2/5 


By Cauchy—Schwarz, we have that 
#{n © [X, 2X]: H{L)(n),...,L,(n)} OS > ck'/?} 
2 -1 
> (« > (Hi : L,(n) € S}— ck?) ( » wa) (62) 
n€[X,2X] n€[X,2X] 


provided the sum being squared is positive. Therefore, by (59) and (61), we see that 
for c sufficiently small we have 


#{n © [X, 2X] : H{L)(n),...,L,(n)} OS > ckl/?} 


2 
> ane (nf: 00) « vn!) 


ne [X,2X] 
n=vo (mod W) 


> W2(log X) 10-42 > X exp(—(log X)!/”). (63) 


Here we have used the fact that k < (log X)!/> and W « exp((log X)?/°) in the last 
line. This completes the proof of Theorem 4.3. oO 
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Infinite Sumsets with Many Representations 


Melvyn B. Nathanson 


To Helmut Maier on his 60th birthday 


Abstract Let A be an infinite set of nonnegative integers. For h > 2, let hA be the 
set of all sums of / not necessarily distinct elements of A. Suppose that £ > 2, and 
that every sufficiently large integer in the sumset HA has at least ¢ representations. 
If £ = 2, then A(x) > (log x/ log h) — wo, where A(x) counts the number of integers 
a € A such that 1 < a < x. Lower bounds for A(x) are also obtained for £ > 3. 


1 Representation Functions of Sumsets 


Let A be a set of integers and let h > 2 be an integer. The counting function A(x) 
counts the number of positive integers in the set A that do not exceed x. The h-fold 
sumset AA is the set of all integers n that can be written as sums of h not necessarily 
distinct elements of A. For every integer n, the representation function rg ;,(n) counts 
the number of h-tuples (a), a2,..., dn) € A® such that 


ay <a) S++ <a 


and 
a) + az +++ + ay = 1. 


A Sidon set is a set A of nonnegative integers such that every element in the 
sumset 2A has a unique representation, that is, r42(n) = 1 for all n € 2A. More 
generally, for positive integers h and s, a By,s-set is a set A of nonnegative integers 
such that r4,,(n) < s for all n € hA. Sets whose sumsets have few representations 
have been studied intensively (cf. Halberstam—Roth [2], O’ Bryant [4]). 
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In this paper we consider sets whose h-fold sumsets have many representations. 
A basic result is that if h > 2 and A is an infinite set of nonnegative integers with 
ran(n) = 2 for all sufficiently large integers n € hA, then 


A(x) > logx. 
In the special case h = 2, Balasubramanian and Prakesh [1] proved that there is a 


number c > 0 such that, if A is an infinite set of nonnegative integers with rg .(n) > 
2 for all sufficiently large integers n € 2A, then 


l 2 
Ata) = ef ca ) 
log log x 


for x > xo. This improved the previous result of Nicolas et al. [3], who also proved 
the existence of an infinite set A of nonnegative integers with r42(n) > 2 for all 
sufficiently large integers n € 2A such that 


A(x) « (log x)’. 


It is an open problem to extend these results to h-fold sumsets for h > 3. 


2 Growth of Sets with Many Representations 


Let [u, v) denote the interval of integers i such that u < i < v. Let |X| denote the 
cardinality of the set X. 


Theorem 2.1. Let h > 2 be an integer, and let A be an infinite set of nonnegative 
integers. If ray,(n) = 2 for all sufficiently large integers n € HA, then there is a 
positive number ky = ko(A) such that 


logx 
A — —k 
OF logh e 


for all x = h. 


Proof. For every positive integer k, let 
— [ak nk) 
and 
Ay =ANK. 


The sets {A, :k = 1,2,...} partition A \ {0}. 
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There exists a positive integer mo such that, ifn > no andn € AA, then r4,(n) > 
2. Because A is infinite, there exists a9 € A with hag > no. Choose kp such that 
ag € Aly. 
Suppose that k > ko and Ay 4 @. Let 
a, = max(A,). 
Then 
eee ee 
and 


Anfap + 1h‘) =9. 


Consider the integer 


ha; € NA. 
Because af > do, we have ha; > hag = no, and so ra;,(haz) = 2. It follows that 
the set A contains nonnegative integers a1, ..., a, such that 
a, < ap (1) 
a Sa, 5°:' Sa (2) 
and 


a) +a) +--+ + ay = haj. 
Because A is a set of nonnegative integers, we have 
an < ha;. 
Inequalities (1) and (2) imply that 
hay = ay + ay + +++ + ay < hap 
and so 
a, < ay, < haz < et, 
Therefore, 


a+1 ce. 


IA 


an < 
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Equivalently, 
an € [ag + 1h") = [ag + 1h) U Aga. 
The maximality of af implies that AN lax +1, hk) = @, and so ay, € Ag+, 4 O. It 
follows by induction that A, 4 @ for all k > ko. 
Let x > h, and choose the positive integer f such that 


hi<x<pA't!, 


Because 
lo. 2) 
A\ {0} = Ax 
k=1 


it follows that 


logx 
logh 


A(x) = A(h') = t— (ko - 1) > — ko 


for all x => h. This completes the proof. 


Theorem 2.2. Let £ > 3 be an integer, and let A be an infinite set of nonnegative 
integers. If ran(n) = €& for all sufficiently large integers n € 2A, then there is a 
positive number k; = k,(A) such that 


£—1)1 
HG ss EN 
log 2 
for all x => 2. 
Proof. For every positive integer k, let 
i = [a ; 2) 
and 
Ap=AN k. 


The sets {A, :k = 1,2,...} partition A \ {0}. 

There exists a positive integer mg such that ifn > mp andn € 2A, then r42(n) > &. 
Because A is infinite, there exists dg) € A with 2ay9 > no. Choose kp such that 
ag € Ao. 

Suppose that k > ko and A, 4 @. Let 


a, = max(A;). 
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Then 
cae tea 
and 
AN [ag + 1,24) = 0. 
Consider the integer 
2a; € 2A. 


Because af > do, we have 2a; > 2a9 > no, and so r42(2a;) = £. It follows that 
the set A contains a subset 


{qj ii= 1,2 andj = 1,...,€-—1} 
such that, forj = 1,2,...,€— 1, we have 
A) < 4,2 <+0* < ayy < ay < ayy_| < +++ <ay2 < a) 
and 
aj + ar; = 2a;. 
Moreover, a; = 0 implies that 
a2j = 2a, 


forj = 1,2,...,€—1. 
Because 0 < aj; < ay j, we have 


* 
2a; =a j+aj < 2a; 
and so 
* * k+1 
ay < a2j < 2ay < 2°". 
Therefore, 


* k+1 
a, +1 aj <2 $ 


280 M.B. Nathanson 


Equivalently, 
an € [ag + 1,2°*1) = [ag + 1,2) UAga 
forj = 1,2,...,@—1. Because AN la; + 1, 2*) = @, we see that a2; € Ax+1 for 
j= 1,2,...,€—1, and so |Ax+1| => €— 1. It follows by induction that |A;| > €—1 
for allk > kyo + 1. 
Let x > 2, and choose the positive integer f such that 


W<x<2t!, 


Because 
lo ) 
A\ {0} = Ax 
k=1 


it follows that 
A(x) = A(2') = (€— 1I)¢— ko) 


> (= 1) (PS -h0-1) 


— (-llogx _ 
= log 2 ky 


where k; = (€ — 1)(ko + 1). This completes the proof. 


Theorem 2.3. Leth > 3, € > 3, ands > | be integers, and let A be an infinite 
Bn-1.5 Set of nonnegative integers. If ran(n) = £ for all sufficiently large integers 
n € NA, then there is a positive number ky = k)(A) such that 


£—1)1 
ay gE ee oy 
slogh 
forall x = h. 
Proof. For every positive integer k, let 
I az [hi! nk) 
and 
Ap=AN i. 


The sets {A, :k = 1,2,...} partition A \ {0}. 


Infinite Sumsets with Many Representations 281 


There exists a positive integer 19 such that, ifn > m9 andn € hA, then rg;,(n) > €. 
Because A is infinite, there exists dg € A with hay > no. Choose ko such that 
ay € Af. 

Suppose that k > ko and Ay 4 @. Let 

a, = max(A,). 
Then 
apace Na 
and 
An [ap + 1.h') =9. 
Consider the integer 


hax € hA. 


Because af > do, we have ha; > hay > no, and so ra,(ha;) = £. It follows that 
the set A contains a subset 


{ayj:t=1,...,handj=1,...,€-1} 
such that, forj = 1,2,...,2—1, 
aj < nj (3) 
Qj Sdjy S00 Spy S nj (4) 
ayjtarjt--- + ay-15 + anj = haz 
and 
(41 j,42j,+-+ 54-17, 4nJ) x (aij. doj.+6-.dn—1y’s Any’) 


for 1 <j <j’ < €—1. Moreover, fori = 1,...,h-—1 andj = 1,...,é— 1, the 
inequality a;; > 0 implies that 


ang < ha; 
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Let b < A and let J be a subset of {1,...,€— 1} such that a,; = b for allj € J. 
Ifj € J, then 


Qyjt aaj tes + ap-1j = ha; — ang = ha, — b 
and so 
ran—i (haz — b) = |JI. 
Because A is a Bp_;.5-set, we have 
ran—i(haz — b) < s 
and so |J| < s. The pigeonhole principle implies that 


£-1 


AY 


ang f= 1 onyt = 1}) = 
It follows from inequalities (3) and (4) that 


* 
hay = aj + aaj ++++ + any < han; 


and so 
a, < anj < haz < ee, 
Therefore, 
ay, +1 <anj < pitt, 
Equivalently, 


anj € [ag + 1,h*t!) = [ag + 1,n') U Aga 
forj = 1,2,...,£—1. Because AN [ax - 1h‘) = @, we see that 
{anj if =1,2,...,€-—T} C Angi 
and so 
-1 


L 
|Az+1| = ——. 
AY 


It follows by induction that |A;| > (€ — 1)/s for all k > ky +1. 
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Let x > h, and choose the positive integer f such that 
hi<x<hp't, 
Because 
Co 
A\ {0} = J Ax 
k=l 
it follows that 
A(x) = A(h') = (5+) @— ko) 
f-1 logx 
Pa) (3 — ko 1) 


— (-Ilogx _ 
“~ — slogh ky 


where ky = (£ — 1)(ko + 1)/s. This completes the proof. 
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On the Ratio of Consecutive Gaps 
Between Primes 


Janos Pintz 


Dedicated to Helmut Maier on the occasion of his 60th birthday 


Abstract In the present work we prove a common generalization of Maynard— 
Tao’s recent result about consecutive bounded gaps between primes and of the 
Erdés—Rankin bound about large gaps between consecutive primes. The work 
answers in a strong form a 60-year-old problem of Erdés, which asked whether 
the ratio of two consecutive primegaps can be infinitely often arbitrarily small, and 
arbitrarily large, respectively. This is proved in the paper in a stronger form that not 
only dy, = Pn+1 — Pn can be arbitrarily large compared to d,+ but this remains true 
if d,41 is replaced by the maximum of the k differences d,41,...,d,+x for arbitrary 
fix k. The ratio can reach c(k) times the size of the classical Erd6s—Rankin function 
with a constant c(k) depending only on k. 


1 Introduction 


The difference between the consecutive primes, the expression 


dy = Pnt+1— Pn; (1) 


where Y = {p;}%, denotes the set of primes, has been investigated probably since 


i=1 


the time of the Greeks. The Twin Prime Conjecture asserts 
d, = 2 infinitely often. (2) 
This conjecture settles at the conjectural level the small values of d,,. Concerning 


the large values even the suitable conjecture is not completely clear. However, it 
seems to be that Cramér’s conjecture [7, 8] 
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lim sup 
n—>0o io? n 


=QM=1 (3) 


is near to the truth. Granville suggested [28, 29], just based on the famous matrix 
method of Helmut Maier that the correct value of C is instead of 1 slightly larger 


Ce2e?% > 1, (4) 


However, most mathematicians agree that the correct maximal order of d,, should be 
(log nyt) : 

In the present work we give a short overview of the older and the recent new 
breakthrough results of small gaps between primes, of the present state of art of 
long differences. Our final result will be a common generalization of them (apart 
from the very recent ones proved in the last 6 months) which deals with a problem 
raised first by Erdés and Turan about small and large values of 


d+ 1 
dy 


(5) 


As it will be clear from the history and from the proof of our new result we will 
often refer to important and ingenious results of Helmut Maier, which often (but not 
exclusively) use his famous matrix method. 

Since there were more than 40 papers discussing small and large values of d,, 
(or their relations) we will be relatively brief and try to concentrate on the most 
important developments. 

Due to the prime number theorem, 


N 
_ Pyt+ia2 2 
: 2 ith a lle, (6) 
we have trivially 
d d, 
A := liminf < 1 < limsup —— =: }. (7) 
n>oo logn n>oo logn 


The present work appeared in its first version at the beginning of July 2014 at 
arXiv: 1406.2658 [40], so it is based on our knowledge at that time. The very recent 
dramatic new developments are mentioned in Chapter (A*) of the Introduction. 


(A) Large Gaps Between Primes (Until August 2014) 


The first non-trivial result, A > 1, was reached in 1929, when Backlund [1] proved 
A > 2. One year later Brauer and Zeitz [5] improved this to A > 4. Another year 
later E. Westzynthius [49] showed already A = oo, more precisely 
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dy/\ 
lim sup pte Ree > 2e’, (8) 
n>oo log, n/log,n 


where log,, n denotes the v-fold iterated logarithmic function. 
Erdés [13] improved this in 1935 to 


. d,/logn 
lim su 


—_—__-__ 9 
noo log, n/(log;n)? a 


Finally, 3 years later Rankin [45] succeeded in showing the function which is 
even currently, after more than 75 years the best result, apart from the constant: 


; d,/logn 1 
lim sup >s. 
noo log, nlog,n/(log,n)* ~ 3 


(10) 


Since, apart from the improvements of the constant 1/3 to e” (in works of Ricci, 
Rankin and Schénhage between 1952 and 1963), the inequality (10) remained the 
strongest, Erdés offered in 1979, at a conference in Durham a prize of USD 10,000 
for a proof of (10) with an arbitrarily large constant C in place of 1/3; the highest 
prize ever offered by him for a mathematical problem. However, this did not help 
either in the past 35 years. Nevertheless, strong analytic methods introduced by 
Helmut Maier and Carl Pomerance, combined with the original Erd6s—Rankin sieve 
procedure helped to prove [32] in 1990 (10) with C = 1.3126...e”. 

Finally, the current best result, the relation (10) with 1/3 replaced by 


=e" (1) 


was reached by J. Pintz [36]. The methods to reach it involve besides the classical 
sieve methods and the large sieve used by Helmut Maier and Carl Pomerance, a 
pure graph-theoretical result, which, however, is proved in [36] using a so-called 
semi-random method of E. Szemerédi. 


(A*) Very Recent Developments Concerning Large Gaps 
Between Primes 


It was a great surprise when in August 2014 simultaneously and independently K. 
Ford, B. Green, S. Konyagin, T. Tao [21] and J. Maynard [34] showed the 76-year- 
old conjecture of Erdés mentioned in (A). In other words, this means that there 
exists some function h(n) — oo as n > oo such that 


7 d,/logn 
imsu 
nati h(n) log, n log, n/ (log, n)? 


(12) 
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Four months later in a joint work they showed that (K. Ford, B. Green, 
S. Konyagin, J. Maynard, T. Tao [22]) h(n) can be chosen as log; n, that is 


dn/\ 
fieeip a _ hy (13) 
noo log, log, n/log,n 


(B) Chains of Large Gaps Between Consecutive Primes 


In 1949 Erdés [16] proposed the problem whether k consecutive prime gaps d,+1, 
..+, dyn+x can be simultaneously much larger than its mean value, i.e. whether 


: min(d,+1, erates dn+k) 
lim sup a 
noo log n 


00, (14) 


and succeeded in showing this for k = 2 in the same work. 

Thirty years later Helmut Maier [30] introduced his famous matrix method and 
showed this conjecture even in a stronger form when log n is replaced by the Erd6s— 
Rankin function (cf. (10)). He proved for any k 


; min(dy+1,.--,dn+~)/logn 
lim sup 
nooo _ log, nlog,n/(log; n)* 


> 0. (15) 


(C) Small Gaps Between Consecutive Primes 


Unlike the quick progress in the case of large gaps an analogue of the early result 
A > 4 of Brauer—Zeitz from 1930, i.e. A < 1/4 was essentially the best result still 
even 75 years later before 2005, and it was reached by Helmut Maier [31] also by 
his matrix method in 1985. 

The first, although unpublished and conditional, result was reached by Hardy and 
Littlewood (see [46]) in 1926 who showed that the Generalized Riemann Hypothesis 
(GRH) implies A < 2/3. However, the first non-trivial unconditional result was 
proved by Erdés [14], 


A<1-c, (16) 


with an unspecified explicitly calculable constant c; > 0. 

After much work, calculating and improving c,, the next breakthrough came 
by the large sieve of Bombieri [3] and Vinogradov [48] which enabled Bombieri 
and Davenport [4] to eliminate GRH and (incorporating also Erd6s’ ideas into their 
work) to show unconditionally 


A < (2+ V3)/8 = 0.466... . (17) 
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After five further improvements of Piltjai, Huxley and Fouvry—Grupp this was 
reduced to 0.4342. The next big step was the mentioned result of Helmut Maier 
[31], the inequality 


A < 0.2486. (18) 


Twenty years later D. Goldston, J. Pintz and C. Yildirim succeeded in reaching 
the optimal value 


A=0 (19) 


(see Primes in Tuples I and III in [24, 25]). Further they showed that d,, can be much 
smaller than log n, in fact it was proved in [27] that 


dy, 
lim inf ——— = 0 forany c> 1/2. (20) 
(log n)° 


This was further improved by Pintz [39] for any c > 3/7 which in some sense 
was the limit of the original GPY method as proved by Farkas et al. [20]. 

Furthermore already the first work [24], proving A = 0 in details contained 
also a conditional theorem. In order to formulate it we will describe the notion of 
admissibility for a k-tuple #7 = {h;}_, of distinct integers. 
Definition 1.1. 7 = {h;}'_, (hi 4 h; for i 4 j) is called admissible if for any 
prime p the set # does not occupy all residue classes mod p. 

k 
This is equivalent with the formulation that [] (n+ /;) has no fixed prime divisor. 
i=l 

Remark 1.1. Although we usually suppose that h; > 0 this is clearly not necessary 
in view of the consequence which is translation invariant. 


We also introduced two connected conjectures which were named after Dickson, 
Hardy and Littlewood and which represent actually a weaker form of Dickson’s 
prime k-tuple conjecture [11]. 


Conjecture DHL(k, ko). If % is admissible of size k, then the translated sets n + 
HG, contain for infinitely many n values at least ky primes if N > No(k, ko). 
The special case ky = 2 had special attention because of his connection with the 
Bounded Gap Conjecture. liminfd,, < C with a suitable absolute constant C. 
noo 
The mentioned connection is the simple 


Proposition 1.1. [f Conjecture DHL(k, 2) is true for some k, then the Bounded Gap 
Conjecture is true. 
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To formulate our conditional result we still need the following 


Definition 1.2. A number % is called a level of distribution of primes if for any 
A, &€ > 0 we have 


xX C(A, €)X 
>) max} YS) logp— = \ as (21) 
q<Xx—¢ (4.9)=1 p<X ¢(q) (log ) 
p=a(mod q) 


The result that (21) holds with # = 1/2 is the famous Bombieri—Vinogradov 
theorem [3, 48], while Elliott-Halberstam [12] conjectured that even } = 1 isa 
level of distribution. 

We can introduce for any 3 € (1/2, 1] the 


Conjecture EH(3). (21) is true for J, i.e. 3 is a level of distribution of the primes. 
We showed in our original work 


Theorem 1.1 ((24]). Jf EH(®) is true for any 3} > 1/2, then DHL(k, 2) is true for 
k > C\(#) and consequently, lim inf d, < C2(*). 


Soon after this, Motohashi and Pintz [35] (MR 2414788 (2009d:1132), arXiv: 
math/0602599, Feb 27, 2006) showed in the work entitled “A Smoothed GPY 
sieve” that (21) can be substituted with the weaker condition that it holds for 
smooth moduli of g in (21) (by that we mean that for moduli g having all their 
prime factors below g’—or even X’—with an arbitrarily fixed b) and the maximum 


taken over all residue classes a with (a,q) = 1 can be reduced to those satisfying 
nl (a + h;) = 0 (mod gq) which are trivially the only cases appearing in the proof. 
Finally, Zhang [50] showed that for b = xm the above condition holds with 
v=F+a. 

This led to the very recent result. 


Theorem 1.2 ({50]). DHL(3.5-10°, 2) is true and consequently lim inf d, < 7-10’. 
noo 


Some months later the joint effort of many mathematicians showed a stronger 
form of Zhang’s theorem in the Polymath 8A project led by T. Tao [42]. 


Theorem 1.3 (Polymath 8A). DHL(632, 2) is true and consequently liminfd, < 
noo 
4680. 
Another improved version, proved independently by another refinement of the 
GPY method (in spirit closer to an elementary version of the method worked out 
in collaboration with S. W. Graham, see [26] and which was also close to the first 


attempt of Goldston and Yildirim [23] which finally led to A < 1/4 in their version) 
was reached by Maynard [34], which showed the even stronger 


Theorem 1.4 ((34]). DHL(105, 2) is true and consequently lim inf d,, < 600. 
noo 
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T. Tao used the same approach independently and simultaneously with Maynard 
and with help of his Polymath Project 8B [45] (involving further new theoretical 
ideas and a huge number of computations) they showed 


Theorem 1.5 (Polymath 8B). DHL(50, 2) is true and consequently liminfd, < 
noo 
246. 


It is interesting to note that the above methods did not need the results or ideas of 
Motohashi-Pintz and Zhang, neither any weaker form of a result of type 0 > 1/2. 

In complete contrast to this, the Maynard—Tao method shows the existence of 
infinitely many bounded gaps (although with weaker numerical bounds than 600 
or 246) with any fixed positive distribution level of the primes. The first result of 
this type (that is, a positive distribution level of primes) was proved by Rényi in 
1947-1948 [47]. 


(D) Chains of Bounded Gaps Between Consecutive Primes 


The original GPY method [24] furnished only under the very strong original Elliott— 
Halberstam Conjecture EH(1) the bound 
Pn+2—Pn 


Ao := liminf 22t2 Pe" — 9, (22) 
noo log Dn 


Thus it failed to show on EH even DHL(k, 3) for some k. The additional ideas of 
Motohashi-—Pintz and Zhang helped neither. 
Already Erdés mentioned [16] as a conjecture 


fining ee 
n—>oo log n 


1 (23) 


which was proved in the mentioned work of Helmut Maier [31]. In the same work 
he has shown 


n+r— Pn = 5 
A. Stat ts oe (r- ze o(t)) (r > 00). (24) 
noo logn 8 
This was improved in [25] to 
A, <e? (Vr—1)’. (25) 


However, as mentioned already, even (22) was open unconditionally. In this aspect 
the Maynard—Tao method was much more successful, yielding 
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Theorem 1.6 (Maynard-Tao). liminf(pn+- — Pn) < Ce” for any r with an 
n—->Co 


absolute constant C. 


2 Consecutive Values of d,,: Problems of Erdés, 
Turan, and Polya 


The questions discussed in Sect. 1 referred to single or consecutive small values of 
d, or to analogous problems dealing with solely large values of d,. In 1948 Erdés 
and Turan [19] showed that d,41 — d, changes sign infinitely often. After this, still 
in the same year, Erdés [15] proved that 


dn : dn 
liminf “** <1 <lim sup aisle (26) 


He mentioned 60 years ago [17]: “One would of course conjecture that 


dy, . dy 
liminf “+ = 0 and limsup “+ = on, (27) 
noo n noo n 


but these conjectures seem very difficult to prove.” Based on a generalization of the 
method of Zhang [50] the author proved (27) in [37]. 

In the mentioned work of Erdés and Turan [19] they also asked for a necessary 
and sufficient condition that 


k 


Ss GiPnti (28) 


i=1 


should change sign infinitely often as n — oo. They observed that the condition 


k 
ya; =0 (29) 
i=1 


is clearly necessary. Using (29) one can reformulate the problem and ask for 
infinitely many sign changes of 


l k 
ys d+ = — >: AiPnti (30) 
i=1 i=1 


if we use the notation 
Mi 
a= Soa G=1,2,...,.k-1), €=k-1. (31) 


i=1 
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This form shows an observation of Pélya (see [18]) according to which a; (j = 
1,...,4€) cannot all have the same sign if (28) has infinitely many sign changes. 
Erdés [18] writes: “It would be reasonable to conjecture that Pélya’s condition is 
necessary and sufficient for (30) to change sign infinitely often. Unfortunately the 
proof of this is not likely to succeed at the present state of science.” 

After this Erdés showed [18] that (28), i.e. (30) changes sign infinitely often if 


£ 
Sia =0, a £0. (32) 
i=1 


The author announced in [38] the proof of the following 


Theorem 2.7 ([38]).. The sum (28), i.e. (30), changes sign infinitely often if at least 
one of the following conditions holds (£ > 2) 


£ 
Yo ail < coll) > lal, 
i=l i=1 
with a sufficiently small explicitly calculable constant co(£) depending on £; 
(ii) if aj € [1, €] such that 


£ 


(i) 


£ 
> Jail <lajl, sgnar A sgnoy, ie [1,4] \j; (33) 
i=1iAj 


(iii) if the Hardy—Littlewood prime k-tuple conjecture is true for k = &. 


Now, (iii) shows that the mentioned conjecture of Erdés, namely that Pélya’s 
trivial necessary condition (i.e., that all a; cannot have the same sign) is probably 
really a necessary and sufficient condition for (30) to change sign infinitely often. 


3 Results 


In the present work we will show a kind of improvement of the result (10), which 
can be considered also as a common generalization of our result (27) and Maynard— 
Tao’s mentioned theorem about the strongest known estimates of chains of small 
gaps between consecutive primes. The result will also give an improvement on the 
recent proof of the author [37] which solved the 60-year-old problem (27) of Erdés 
and improves the result (14) for k = 2 due to Erdés [16]. 

The proof will also show a simple method (already observed in [37] by the author 
after the proof of Zhang [50]) which makes the producing of bounded gaps (or 
chains of bounded gaps in case of Maynard and Tao) effective, since the original 
versions used a Bombieri—Vinogradov type theorem which again made use of the 
ineffective Siegel—Walfisz theorem. 
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Remark 3.2. The recent work [2] shows implicitly a way to make the Maynard—Tao 
theorem effective. 


Finally we mention that concerning the Theorem [37] of Sect.2 we can fur- 
ther give a very simple sufficient condition for (28), i.e.(30) to change sign 
infinitely often. To simplify the problem we can clearly suppose a; 4 0, ae 4 0. In 
this case the sufficient condition is simply: 


sena, ~ sgn ae. (34) 


(if we do not make the trivial supposition a; 4 0, a¢ 4 0 we can clearly formulate 
it in the way that the first and last non-zero elements of the sequence {or}, should 
have opposite sign.) 

Summarizing, we will prove the following results, using the basic notation d, = 
Pn+1 — Pn Of (1), log, n for the v-fold iterated logarithmic function. 


Theorem 3.8. Let k be an arbitrary fixed natural number, c(k), N(k) suitable 
positive, explicitly calculable constants depending only on k. Then for any N > N(k) 
there exists ann € [N, 2N] such that 


dn+1 5 c(k) log N log, N log, N 


35 
max(d,,...,@y)—441) (log, N)? (39) 


Essentially the same proof gives the analogous result: 
Theorem 3.8’. Under the conditions of Theorem 3.8 we have 


dn—k c(k) log N log, N log, N 


> 36 
max(dy,...,dn—K+1) (log, N)? (36) 


Theorem 3.9. Let ko be an arbitrary fixed natural number, c(ko), N(ko) suitable 
positive, explicitly calculable constants depending only on ko. Then we have ak = ko 
such that for any N > N(ko) there exists ann € [N,2N] such that 


min(d,—x, dn+1) " c(ko) log N log, N log, N 
max(dh, or) dn—k-+1) (log; N)? 


Theorem 3.10. Let € > 2 be an arbitrary integer, a,...,a@¢ real numbers with 
a, #0, a F 0. Then the expression 


L 
= idn+i (37) 
i=1 


changes sign infinitely often as n — o if 


sgna, # sgnae. (38) 
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We remark that Theorem 3.10 trivially follows from Theorems 3.8 and 3.8’. 


4 Proofs 


The proof requires a combination of the methods of Erdés—Rankin and that of 
Maynard—Tao. Since Theorem 3.10 follows from Theorems 3.8 and 3.8’, further 
Theorem 3.9 implies Theorems 3.8 and 3.8’, it is sufficient to prove Theorem 3.9. 
Concerning Theorem 3.9 we will show that we will have infinitely many cases when 
a block of at least kg consecutive primes in a bounded interval are preceded and 
followed by two primegaps of length at least 


co(ko) log N log, N log, N 
(log; N)? 


‘= co(ko) log Nf (log N) (39) 


each, where the corresponding primes p',s satisfy v € [N, 2N]. To be more specific 
we will choose an arbitrary set of m different primes {h;}_, with the property that 
for any i,j,t € [1,m],i xj: 

m < C3(m) < hy < hy <+++ <n < Ca(m), m= [Cse], hy + hi—hy (40) 


with the absolute constants Cs, Cs and the constants C3(m) and C4(m) depending 
on m to be chosen later. Denoting 


M= I] Pi (41) 


DiSR, pixh; (j=1....m) 
we try to determine a residue class z ( mod M) with the property 
(Z+h;,M)=1 (<i<m), (42) 
(z+ v,M) > 1 forall v € [0, co(ko)RF(R)] \ {hi}. (43) 
We remark that we will have Rf(R) = log Nf (log N). 
To describe the procedure briefly we will look for a block of at least ky primes 
among the numbers 


L+h,...,6+hm, £€=z(mod M) (44) 


by the Maynard—Tao method, using the refinements in [2]. This will contain at least 
ko consecutive primes in a bounded block, while the two intervals 


[¢ — co(ko)RF(R), € — Im) and (€ + hm, € + co(ko)RF(R)] (45) 
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will certainly not contain any prime due to (43). Since (cf. [2]) we can choose M as 
large as a suitable small power of N, this will prove our theorem as by the Prime 
Number Theorem we will have 


logN x logM~ R, f(logN) ~ f(R) (46) 


and consequently Rf(R) = log Nf (log N). 
Let us use parameters 0 < v < w < R/2 to be chosen later, and let 


Pi: = I] D, (47) 
pSv, p#hj (1<j<m) 
P,:= || p. (48) 
v<psw 
P3:= |] p. (49) 
w<p<R/2 
Pai= TT p, (50) 
R/2<p<R 
v = log? R, U = co(ko)Rf(R), P := P(R) = P\ P2P3Pa, (51) 


with a suitably small c(ko), chosen at the end. If we have a Siegel-zero, that is, 
a real primitive character y; modr,r < N, with a zero f of the corresponding 
£-function #(s, x) satisfying 


C7 
log N’ 


B>l (52) 


then this character and zero are uniquely determined by the Landau—Page theorem 
(see, e.g., [9, p. 95]) if c7 is chosen as a suitably small explicitly calculable positive 
absolute constant. If y; is real primitive mod r, then r has to be square-free apart 
from the possibility that the prime 2 appears on the second or third power in r. Let 
us denote the greatest prime factor of r by g. Since we have 


log r 
Jr 


(see [9, p. 96]) where the implied constant is explicitly calculable we have by (52) 


1-B> (53) 


r > (log N)*”. (54) 
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Consequently, by the Prime Number Theorem and the “almost square-free” property 
of r we have 


q > logr > log, N > oo as N > ow. (55) 


If there is no Siegel-zero, then we will let g = 1. Now we change slightly the 
definition of P; and P as to exclude from their defining product the possible divisor q. 
Let 


4 
P? = Pi/q if q| P) (1 <i< 4), otherwise P? = P;, P* =| | Pr. (56) 


i=1 


This change is not necessary for finding z with (42)-(43) but to assure uniform 
distribution of primes = z ( mod M) in the Maynard-Tao procedure (and to make 
later the whole procedure effective as well). This means also that in contrast with 
the rough description at the beginning of this section the role of M = P will be 
played actually by 


M* = P* = P/q if q| P (otherwise M* = M = P= P*). (57) 
Returning with this change to our problem of choosing z mod P* let 
z = 0 (mod Pf P3) (58) 


while we will choose z (mod P} P73) suitably later. The letter p will denote in the 
following an unspecified prime. The choices of v and U in (51) guarantee that 


GteP P= 1 O<2=0,"F71) (59) 
if and only if either 
n= pq" | [he (a= 0, a; > 0) and p> R/2 (60) 
i=l 
or 
n is composed only of primes p | P>q I] hi. (61) 


i=1 


Our first (and main) goal in finding z mod P* will be to find residues a, with 
p | P3 so that choosing 


Zz =a, (mod p) for p | P5 (62) 
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the condition (z* + h;, P*) = 1 (cf. (42)) should remain true and simultaneously we 
should have for as many as possible numbers n of the form (60) (z+, P*) > 1. By 
a suitable choice of the parameter w we can make the whole set of n’s satisfying (61) 
relatively small, due to de Bruijn’s result [10] which we use in a weaker form, proved 
by Rankin [45]. The present form is Lemma 5 of [30]. 


Lemma 4.1. Let W(x, y) denote the number of positive integers n < x which are 
composed only of primes p < y. For y < x and y approaching to infinity with x, we 
have 


lo lo 
W(x, y) < xexp |- Pe tog + tog,» +0 ( P2*) : (63) 
3 


Since W(x, y) is clearly monotonically increasing in x we can estimate the number 
of n’s in (61) from above by 


(log uy W(U,w). (64) 
Now, choosing 
w= exp[a (log R log, R/ log, R)| (65) 


with a constant a to be determined later, the quantity in (64) is by (51) clearly 


(1 + o(1)) log, RlogR 
U _ 1 1)) log, R 1)log,R 
« exp | wipe Rlog R/ loz, + (1 + o(1)) log, R + (m + 1) log, 
U R 
< =0 66 
log’ U C3, os 


if we chose, e.g., 


1 
en rat (67) 

This bound will be completely satisfactory for us. So we have to concentrate on 
the numbers 7 in (60). 

The simplest strategy in choosing a, ( mod p) in (62) would be to sieve out for 
the smallest p > v the largest residue class mod p among the No numbers of the 
form (60) and repeat this consecutively for all primes in P}. What makes our task 
more complicated, is the fact that we have to preserve the condition (z+h;,M) = 1 
in (42), so we have to choose for any p; € P5 


a, # —h; (mod p;) for i= 1,2,...,m. (68) 
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First we observe that an easy calculation shows that the Prime Number Theorem 
implies 


U R 
No > a m els m 
1,0] ,...,@m>0 q* I] hj 2q% qT] h;i 
i=1 i=1 
U m 1 1 
~ 1+-+ 2 1+-—-+-—+ 
(leds) tede) 
U yo 1\7! -.2au 
% i) fi 2) = (69) 
logU q h; log U 


i=1 


if C3(m) in (40) was chosen sufficiently large (taking into account also (55)). 
We will choose the residue classes a, one by one for all primes from v to w 
and consider at the ith step the arising situation. We will be left at the jth step with 


Nj; < No remaining values of n’s with (60). If Nj is at any stage of size < Sloe R’ 
0g 
then we are ready. Thus we can suppose Nj > R/(5 log R). 
Otherwise, if w and all aw; (1 < i < m) are determined, then we have trivially in 


case of 


g’ | [ati > VU (70) 


i=l 
at any rate altogether at most 

0 (VUtlog v)"*") (71) 
numbers of the given form (60). 


On the other hand, if (70) is not true, then we have by the Brun—Titchmarsh 
theorem altogether at most 


3mU 8U, 
= a (72) 
9 (Pia" i ni) log JU pig” ( II hi) log U 
i= i=l 
numbers n of form (60) which lie in one of the m bad residue classes {h;}7_, mod pj. 


Adding (72) up for all possible non-negative combination of @, a1,...,Q@m we get 
altogether (cf. (69)) at most 


16mU 7 16mco(ko)RF(R) 7 Nif (R) Z Nj 
pjlogU pjlogR Dj 4log*R 


(73) 
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bad n values of the form (60) by pj > w and N; > R/(5logR). This means that 
choosing from the remaining p; — m residue classes that one which sieves out the 
most elements from the remaining set of size N; we obtain for the size of the new 


remaining set 
1 
Nj ( ! 4 log? z) 


Njti < Nj - 
J+ J pj —m 

ie 

< Nj l- 2105 R 

Pj 

1 1—log~?R 

<nj(1- =) ; (74) 
Pj 


This means that by Mertens’ theorem we obtain a final residual set of cardinality at 


most 
1 1—log~?R log v 1—log~?R 
N* <M |] (1--) ~No( = ) 
Dp log w 


u<p<w 


Pq 


1—-log~?R 
=n 3 logs R(1 + o(1)) 
=o alogRlog,R 


log5 R 8mU_ It 
< 4mNo = ; 
logRlog,R ~ logU_ f(R) 
8mco(ko)R — 8mco(ko)R — 2(R) — 2(R/2) 


log U logR < 4 oe) 


if the value of co(ko) in (39) was chosen sufficiently small. 

Formulas (66) and (69) mean that after the sieving with suitably chosen Oj, 
(mod p;), p; € P> and taking into account that the set satisfying (61) was at any 
rate small, all elements of (60) and (61) can be sieved out by one third of the primes 


from P,4 choosing for every remaining n* = pq® I h;“ in (60) and (61) a separate 


p* € Ps, p* # p,qand z = —n* ( mod p*), ‘hanis. oes = —n* (mod p*). This will 
assure (z-+n*, M*) > 1. With an other third of p* | P4 we can similarly sieve out the 
remaining negative n* values. The only problem is that for any p* € P4 used above, 
simultaneously with z+ n* = 0 ( mod p*) we have to assure z + h; 0 ( mod p*). 
This is equivalent to that n* — h; = 0 ( mod p*) is not allowed if we want to sieve 
out z + n* with p*. However, for every number |n*| < U and for every i € [l,m] 
the number n* — h; has at most one prime divisor > R/2 > JU. This means that 
for every n* we have at most m forbidden primes to use to sieve out n*. Thus, as 
the number of still “abundant” primes is at every step at least one third of all primes 
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between (R/2, R] we can for all (positive and negative) values of n* consecutively 
choose the primes p* | Pj nearly freely, just avoiding at most m forbidden primes 
at each step, so that we would have, step by step for each p* | Pj, n* 


z+n* =0(modp*), z+h;#0(modp*) (= 1,2,...,m). (76) 


At the end of this procedure some p* | P4 will remain. We can choose for these 
primes a, freely with the only condition that 


a, # —h; (mod p*) (i= 1,2,...,m) (77) 
should hold, in order to assure for the remaining primes p* | P4 also 
(z+ hj,p*) = 1. (78) 


Finally to determine z uniquely mod M/q we choose for the primes p = hj ap 
again essentially freely with the only condition 


ap #—h;(modp) (j= 1,2,...,m) (79) 


which is again possible by h,; > m. In this way we will have finally for any i = 
1,2,...,m with M* = P* =M/q= P/q 


(2+ hi, P*) =1 (80) 

and this means that the relations (42)—(43) will be satisfied with M* in place of M. 

Now, we continue with the Maynard—Tao proof which we take over from [34] 

with the additional changes executed in [2]. We will look for primes among the 
numbers 


L+hy,...,€+)m, €=z(modW), W=M* = P* (81) 


with the value z found in the previous procedure, satisfying with the notation “, = 
{hi an 


(c+h;,W) =1 (= 1,2,...,m) (82) 
(z+ v,W) > 1 if |v| € [0, log Lf(logL)] \ Mn. (83) 


We will show that for L > L(ko) we will find numbers 
€e[L,2L], €=z(modW), #Hl+h € AI <i<m)} =k (84) 


with (81)-(83) which will prove our Theorem 3.9. 
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Remark 4.3. If there are additional primes among £ — h; (1 < i < m) this does not 
change anything since h; < C4(m) < Cg(ko). The introduction of the new variables 
£ and L is only necessary since we look for primes around ¢ instead of investigating 
Pn and d,, so essentially £ ~ nlogn, L ~ NlogN, logL ~ logN, so the size of 
gaps (39) remains unchanged if we substitute NV by L. We remark that (40) and (55) 
assure 


P| | [@i-h) = p = 6n(1), p Hh, (1 St < m) => pt W, (85) 
ij=l 
fi 
analogously to (65)-(69) of Banks et al. [2], which is actually the Maynard—Tao 
theorem. 

We need also the modified Bombieri—-Vinogradov theorem, Theorem 4.1 of 
Banks et al. [2], which is somewhat similar to, but stronger than Theorem 6 of 
[27]. The greatest prime factor of W is < log L, so the level of smoothness of W 
is much better than that (L°) required by the condition of Theorem 10 of Chang 
[6]. So the whole Theorem 4.1 of Banks et al. [2] will remain true. Further, we can 
leave out from the actual sieving procedure the possibly existing g and its multiples. 


O(1) 
1 

This causes just a negligible change of size { 1 — — =1+0 cet) in the 
te) 


q 
weighted number of primes and in the sum of weights. If we choose in (41) 
R < co(ko) log L (86) 


with a sufficiently small co(ko) > 0, then we will have by the Prime Number 
Theorem 


W< Loko A+0(1)) (87) 


so the whole Maynard—Tao procedure will remain valid, as in [2]. (The variable R 
is here completely different from that in [34] or [2].) 


Summarizing, we will obtain at least ky) bounded prime gaps in intervals of type 
[6 — lim, £ + hm] (88) 


and around them two intervals of size (1 + 0(1))co(ko) log Llog, L log, L/(log3 L) 
containing only composite numbers (see (45)) which prove Theorem 3.9. 

Finally, the fact that we left out the largest prime factor g (and its multi- 
ples) of the possibly existing exceptional modulus r yields an effective modified 
Bombieri—Vinogradov theorem and so an effective final Theorem 3.9, consequently 
Theorems 3.8, 3.8’, and 3.10 are effective too. 
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Remarks on Fibers of the Sum-of-Divisors 
Function 


Paul Pollack 


To Professor Helmut Maier on his 60th birthday 


Abstract Let o denote the usual sum-of-divisors function. We show that every 
positive real number can be approximated arbitrarily closely by a fraction m/n with 
o(m) = o(n). This answers in the affirmative a question of Erdés. We also show 
that for almost all of the elements v of o(N), the members of the fiber o~!(v) all 
share the same largest prime factor. We describe an application of the second result 
to the theory of L.E. Dickson’s amicable tuples, which are a generalization of the 
ancient notion of an amicable pair. 


1 Introduction 


Let o(n) := Doda d be the familiar sum-of-divisors function. In this paper, we 
record two theorems concerning the fibers of o. The first of these answers in the 
affirmative a 1959 question of Erdés (see [5, p. 172]). 


Theorem 1.1. Let 6 > 0. For every « > 0, one can find integers m and n with 
o(m) = o(n) and |" — B| <e. 


The primary tool in the proof will be the remarkable recent theorem of Yitang Zhang 
approximating the prime k-tuples conjecture. 
Our second result concerns the multiplicative structure of elements belonging to 
a typical fiber. Building on work of Maier and Pomerance [8], Ford [6] developed 
an extensive theory of o-preimages and used it to answer a number of delicate 
questions about the distribution of g and o-values. For example, he showed that 
the count of o-values in [1, x] is 
_ * 


1 
ieee exp (C(log; x — log, x)? + Dlog,x— (D+ x= 2C) log, x) 
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for certain constants C ~ 0.8178146 and D ~ 2.1769687, and that precisely the 
same estimate holds for the g-function. Here we show how Ford’s methods can be 
adapted to prove the following theorem. 


Theorem 1.2. For asymptotically 100% of the values v in the image of the 
o-function, all of the elements of the set o—'(v) share the same largest prime factor. 


“Asymptotically 100 %” means that the density of such v, relative to o(N), is 1. 

Theorem 1.2 has an amusing consequence for a problem of L.E. Dickson. Recall 
that m and n form an amicable pair if o(m) = a(n) = m +n. Dickson [1] (cf. [2, 
p. 50]) proposed the following generalization: Say that m,,...,n, form an amicable 
k-tuple if o(n;) = ny +no+...+n, for eachi € {1,2,...,k}. (Below, we refer to v 
as the common o-value of the amicable tuple.) Dickson gave a handful of examples 
with k = 3. For several others, with k as large as 6, see [9, 10, 15]. 

The distribution of amicable tuples remains mysterious. For example, we do 
not know if there are infinitely many amicable tuples, even if all k are considered 
simultaneously. In the case k = 2 (the amicable pair case), some progress has been 
made. In 1955, Erdés [4] showed that the set of natural numbers belonging to an 
amicable pair has density zero. This result has been steadily sharpened over the 
years [3, 12-14]. However, when k > 2, we still do not know if the set of numbers 
belonging to an amicable k-tuple has density zero. Thus, the following corollary of 
Theorem 1.2 seems of some interest. 


Corollary 1.1. Asymptotically 0% of the elements in the range of the o-function 
appear as the common o-value of an amicable tuple. 


1.1 Notation 


We reserve the letters p and q for primes. We let P*(n) denote the largest prime 
factor of n, with the convention that Pt(1) = 1. We use w(n) for the number of 
distinct primes dividing n and §2(n) for the number of prime factors of n counted 
with multiplicity. We write 2(n, U,7T) for the number of primes dividing n with 
U < p <T, again counted with multiplicity. We write log, for the kth iterate of the 
natural logarithm. 


2 Proof of Theorem 1.1 


Let L be the closure of the set {log 7 : o(m) = o(n)}. Theorem 1.1 amounts to the 
claim that L = R. Since log * = — log <, the set L is symmetric about 0, and so it 
is enough to show that L contains all nonnegative real numbers. 
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For any finite set of primes Y, define L” in the same way as L but with m 
and n restricted to be divisible by none of the primes in Y. The next lemma is 
fundamental. 


Lemma 2.1. There is a natural number K for which the following holds: Let P be 
a finite set of primes. Let a, < ... < ax be any K real numbers. Then a; — a; € L? 
for some pair ofiandj with <i<j<K. 


The proof requires that we recall Dickson’s prime k-tuples conjecture and the 
spectacular recent progress towards it made by Zhang [18]. A collection of linear 
polynomials ayx + bi, ..., apex + by € Zlx], each with positive leading coefficient, 
is called admissible if ged{] [k_, (ain + bi)}nez = 1. Dickson conjectured that for 
any admissible collection, there are infinitely many n for which all of the ajn + b; 
are simultaneously prime. 

Zhang’s breakthrough result in this direction was the following: 


Proposition 2.1. There is a natural number K for which the following holds: 
Suppose that a,x + bi, aox + bo, ..., Axx + bx is admissible. For some pair of 
iand j with | <i <j < K, the expressions ajn + b; and ajn + b; simultaneously 
represent primes for infinitely many natural numbers n. 


Zhang states his result only in the case when all a; = 1. A stronger version of 
the proposition—explicitly stated for general linear forms—appears in recent work 
of Maynard (see [11, Theorem 3.1]). 


Proof (Lemma 2.1). Assuming Dickson’s conjecture, Schinzel and Sierpinski [16, 
p. 193] showed that there are solutions to 0(m) = a(n) with the ratio m/n arbitrarily 
large. We modify their approach to demonstrate Lemma 2.1. Our proof will show 
that if K is acceptable in Proposition 2.1, then it is also acceptable in Lemma 2.1. 

The set {log ow) : gced(n, I],¢xP) = 1} is dense in [0, oo). Indeed, if f is any 
nonnegative additive function for which (1) Es f(p) diverges, and (2) f(p) > 0 
along primes p, then the values of f(n), with n squarefree, are dense in [0, 00). 
(This follows from a straightforward application of the greedy algorithm.) We apply 
this general fact with f any additive function having f(p) = 0 for pp € # and 
f(p) = log a forp ZF. 

We can assume all a; > 0, by replacing each a; with a; + ao for a suitably large 
do. For each 1 < i < K, fix a sequence {Ay pas of integers coprime to Ipc @P 
satisfying 


(i) 
o(A;") = 


— = a. 
1G) : 
A; 


lim log 
Joo 


For each j € N, we apply Proposition 2.1 to the collection {o(A” )x — 1h). 
This is an admissible collection, since the product of the polynomials at x = 0 is 
+1. By Proposition 2.1, we can find a natural number nj, and integers 1 < aj < 


bj < K, for which p; := o(A)n, — land gj := o(AW nj — | are simultaneously 
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prime. Moreover, we can assume that p; and q; are both larger than j, larger than 


AW (K) 


Ge Rees Aj , and larger than any element of Y. Observe that 


(qj 
pj+ 1 _ o(A; ”) 
q+ oA) 


(bj) (aj) 
so that o(pjA;"") = 0 (qjAj" ); 


also, both pay? and qiA are prime to Hex p. There are only (5) possibilities 
for the pair (a;, bj), and so some choice (a, b) must be taken on for infinitely many 
j. AS j — oo through corresponding values, 


gia,” (qj + DA 


o(A”) o(A”) 
@ — 08 ©) 5 10 2 
DA; (p; + 1A; 


g 
(b) (a) 
Aj Aj 


log 


+ o(1) = log 


+ o(1), 


which gives that a, — a, € L”. 


Proof (Theorem 1.1). We show that Rs is contained in L. Take any a > 0. 

Let « > 0. Apply Lemma 2.1 with Y = @ and q,...,a,% chosen as 
0, re, bins Ate, We find that one of te, ze, ae Ae € L; in particular, LN 
(5z:€) # . Thus, we can choose m, and nm with o(m) = o(m) and 57 < 
log 7 < ¢. Suppose we have already defined m; and n;. Apply Lemma 2.1 
with the same a 1,...,a@, but with F the set of primes dividing i mjn;. We 
find that there are natural numbers mj+1 and noi with o(m4i) = o(nj41), 
ged(mp4injti. Pa mjn;) = |, and ae < log and 
to produce infinite sequences {m,} and {nj}. 


‘ mp ‘ J mp : 
Since each log a = 3g, we may choose J with i log 7 > a. Moreover, if 


< ¢€. We continue ad infinitum 


J is chosen minimally, thena < » log <a+e. With m := Tee m, and 


ty 
n:= ie nj, we see that o(m) = o(n) and 0 < log” —a <e. 
Since € > 0 was arbitrary, a € L. 


Remark 2.1, Ford’s methods (see the proof of the lower bound in [6]) show that 
given any fiber 0 !(v) = {n,...,m}, a positive proportion of all fibers o—'(w) 
have the form {dn,..., dn,} for some natural number d. Clearly, dilating by a factor 
of d does not change the ratios between elements of a set. Thus, not only is every 
B > O well approximable by a ratio m/n, where o(m) = a(n), but it is not so 
unusual to see a ratio close to 6. For example, a positive proportion of v € a(N) 
have two preimages m and n with |" — z| < to, 


Remark 2.2. It seems interesting to observe that in the statement of Theorem 1.1, 
m and n can be taken to be coprime. Indeed, the argument given above produces 
squarefree integers m and n. Since o(m) = o(n), if we write m/n = m’' /n' in lowest 
terms, then also o(m’) = a(n’). 
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Remark 2.3. With obvious modifications, our argument will show that Theorem 1.1 
also holds with g replacing o. In the same source [5] where Erdés mentions the 
problem for o, he claims that this g-variant can be handled in an elementary fashion. 


3 Proof of Theorem 1.2 


3.1 Overview of the Basic Method 


The proof follows the method of Ford and Pollack [7] for showing that most 
y-values are not o-values, and vice versa. We review the strategy of that argument 
here. For f € {y,o}, let V-(x) be the number of f-values belonging to [1, x]. As 
already alluded to in the introduction, one knows [6, Theorem 14] that V,(x) = 
V,(x) for x > 1; thus, the main result of Ford and Pollack [7] follows if it is shown 
that the number of common ¢, o values in [1, x] is o(V,(x) + V,(x)). 

To this end, one begins by constructing [7, Sect.3] sets of and .%, with the 
property that almost all f-values in [1,x] have all their f-preimages in .%. The 
precise definition of the sets .% [7, p. 1679] is quite intricate and incorporates both 
“anatomical” and “structural” constraints. By “anatomical,” we mean multiplicative 
constraints of the sort that often arise in elementary number theory. For example, 
we insist that for a € .%, neither a nor f(a) has an extraordinarily large squarefull 
divisor or “too many” prime divisors. Chief among the anatomical constraints is 
the requirement that every prime p dividing an element of .% be a normal prime, 
meaning that the prime divisors of both p — 1 and p + | are roughly uniformly 
distributed on a double-logarithmic scale. 

By “structural,’ we mean that extensive use is made of the results of Ford 
[6] describing the fine structure of typical f-values and their preimages. As an 
example, precise inequalities are imposed on the prime divisors of elements of .%; 
the ordered list of such primes, after a double-logarithmic rescaling, must (up to a 
small error) correspond to a point in the fundamental simplex of Ford [6, Sect. 3]. In 
addition, we require—and this is the main innovation of Ford and Pollack [7]—that 
a particular linear combination of renormalized prime factors be slightly less than 1 
(this is condition (8) below in the definition of the set .%,). This ensures that sieve 
bounds (such as those that feature in Lemma 3.2 below) eventually yield a nontrivial 
estimate. 

Having constructed such sets .%, it is enough to study how many common 9, o 
values appear as solutions to an equation of the form 


g(a) =o(a’), where a€GaewG. (1) 
Write a = popip2::: and a’ = qoqiq2---, where the sequences of primes p; and q; 


are nonincreasing. The normality condition in the definition of the sets .% implies 
that for small values of i, we have p; ~ qj, at least on a double logarithmic scale. 
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We classify the primes p; and q; dividing a and a’ into three categories: “large,” 
“small,” and “tiny” (as described in [7, §5A]). Then (1) gives rise to an equation of 
the form 


(Po — 1) (1 — 1) +++ (pee — Dfd = (Go + Digi +: 1)+++ (Gee + De. (2) 


Here po, ..-,Px—1 and qo,...,Gx—1 are the large primes in a and a’ (respectively), f 
is the contribution to g(a) of the small primes, d is the contribution to g(a) of the 
tiny primes, and e is the contribution to o(a’) of both the small and tiny primes. 

To finish the argument, we require an estimate for the number of solutions to 
equations of the form (2). We prove a lemma ([7, Lemma 4.1], cf. Lemma 3.2 
below) counting the number of solutions po,...,Px—1, Jo, ---, Gk-1, , f to possible 
equations of the form (2), given d and given intervals encoding the rough location 
of the primes p; and q;. (The phrase “possible equations” means that there are 
many further technical hypotheses in the lemma, but that these hypotheses are 
automatically satisfied because of our choice of the sets .%.) Finally, we sum the 
estimate of the lemma over all possible values of d and all possible selections of 
intervals; this allows us to show [7, p. 1695] that 


1 
#{p(a) : (a,a’) € G x G, and g(a) = a(a’)} K — exp (- F008, at?) : 
ogx 


which is o(V,(x) + V,(x)) with much room to spare. 


3.2 Definition of the Set 


Theorem 1.2 will be proved by modifying the above procedure. We start by giving 
a careful definition of the set % appearing above. The set .%, can be defined in an 
entirely similar manner, but we will not need this. 

Put 


(oe) 


n+l 
F(z) = ax where each a, = i log t dt. (3) 


n=1 


The series defines F as a continuous, increasing function of z on (0, 1). Moreover, 
F(z) > cwasz t 1. Hence, there is a unique g € (0, 1) with F(@) = 1; numerically, 
Oo © 0.5426. We let C = which is + 0.8178. (We met this constant already 
in the introduction.) 

Given a natural number n, write n = po(n)p1(n)p2(n) ..., where po(n) = pi(n) = 
p2(n) = ... are the primes dividing n (with multiplicity). Put 


1 
2| log ol” 


tis log, pi(n)/log,x if i < 92(n) and p;(n) > 2, 
x(n; x) = 
otherwise. 
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For each real number L and each € = (&,...,&-2), we let .“,(&) be the set of 
(x1,...,xz) € R with 0 < x, < xz) <... <x; < 1, and satisfying the system of 
inequalities 


AX, +aox2 +... + appx, < & 


Os Poe FOL, = Eiri 


ayxXp—1 + aoxy < Er-2X1-2. 


The region corresponding to € = (1,1,..., 1) is called the L-dimensional funda- 
mental simplex. Let 


Lo(x) = |2C(log, x — log, x); 


when x is clear from context, we abbreviate Lo(x) to Lo. One of the key observations 
of Ford [6] is that if the components of & are slightly larger than 1, and L is a 
little smaller than Lo, then the n for which (x) (n;x),...,x,(";x)) € “%,(&) generate 
almost all o-values in [1, x]. (See condition (5) below in the definition of .<% for one 
way of making this precise.) 

Let S > 2. We say that a prime p is S-normal if 82(p + 1,1,S) < 2 log, S and, 
for every pair of real numbers (U,T) withS < U<T<p+l, 


|\2Q(p + 1, U,T) — Cog, T — log, U)| < log, Slog, T. 


In other words, there are not exorbitantly many prime divisors of p + 1 up to S, 
and the larger prime divisors of p + 1 are roughly uniformly distributed on a double 
logarithmic scale. (In [7], the definition of S-normal required the same constraints 
also on p — 1, but for this paper working with p + 1 is sufficient.) 

We can now give a precise definition of the set .%. Fix € € (0, 1/2) and assume 
throughout that x > xo(€). Let 


S = exp((log, x)*°),  w = (log, x) ?*3, 

1 
L=|lyo-2-vV1 ; d §=1+ —— ,. 
lo 2fogsxJ. and § = 1+ TE 
Then <& = &%(e,x) is the set of n = po(n)pi(n)... with o(n) < x that satisfy 
all of 


(0) n> x/logx, 

(1) every squarefull divisor m of n or of o(n) has m < (log x)?, 
(2) all of the primes p,(7) are S-normal, 

(3) 2Q(o(n)) < 10log, x and 2(n) < 10 log, x, 
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(4) if d || n and d > exp((logx)!/”), then Q(a(d)) < 10log, o(d), 

(5) (173 x),..., x10 x) € AL), 

(6) nhas at least L + 1 odd prime divisors (counted with multiplicity), 
(7) Pt (po 4 1) > x!/log2*, n(n) < x)/(100 logs x) 

(8) ayx\(n;x) +... + apxy(n;x) < 1—-o@. 


The following statement appears as [7, Lemma 3.2]. 
Proposition 3.2. The number of o-values in [1, x] which have a preimage n ¢ f, is 
—}te 
<« V,(x) (log, x)" 2°. 


This makes precise our claim in the overview that the elements of <% generate 
almost all o-values in [1, x]. 


3.3 The Proof Proper 


Say that the o-value v is exceptional if it possesses two preimages a and a’ for which 
Pt (a) 4 Pt (d’). We wish to show that the count of exceptional o-values in [1, x] is 
o(V,(x)), as x > oo. In view of Proposition 3.2, we may assume v is such that all 
of its preimages belong to .%,. Pick preimages a and a’ with Pt (a) 4 Pt (a’), and 
write 


G=Ppopip2...,. and a’ = qoqiq...., 


where p; and gq; are nonincreasing sequences. 


3.3.1 Rewriting o(a) = o(@’) 


Following the overview presented above, our first task is to deduce from the equation 
0 (popip2°+*) = 0(qgoqig2:::) an auxiliary equation of the shape 


(po + I)(Pi + 1)-+- @e-1 + Difd = (Go + D(qi + 1)--- Ge-1 + De. (4) 
We select A—our cutoff between “large” and “small” primes—in exact parallel with 


how k is selected in [7, Sect. 5A]. In other words, we choose ko as the smallest index 
for which 


log P* (Pi, + 1) < (log, x)/2 46/9, 
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and we take k = ko unless p;, and p;,—1 are too close together in a certain technical 
sense, in which case we take k = ko—1.! As explained on [7, p. 1689], the properties 
of <% imply that 


bo Gj/2=—cAove, (5) 


(This asymptotic formula is essentially [7, Lemma 5.3]; one has only to change the 
p—1toap-+ 1 inits proof.) Moreover, if 0 < i < k, then 


log, pi > (log, x)'/?*/19 and log, qi > (logy x)'/7F6/"", 


(For this, see again [7, p. 1689].) These last estimates, along with condition (1) in 
the definition of %, guarantee that p? + a and q? + a’. Thus, the first k factors 
on the left- and right-hand sides of (4) represent the contribution to o(a) and o(a’) 
from the “large” primes po,...,Px—1 and go, ..., Gx—1, respectively. 

To make the right-hand side of (4) coincide with o(a’), it suffices to define 


€ = O(9kGk+19K+2°**)- 


The choices of f and d are slightly more delicate. If pz; 4 pr, then we put 


f = O(Pepetiss*pr-1), d= o(prpr4i°*:)- 


(In the language of the overview, L is what one thinks of as the cutoff between small 
and tiny primes.) In the general case, we let A be the largest divisor of a supported 
on the primes pz,...,pr—1, and we put f = o(A) and d = o(a/(pi---px—-1A)). 
Then (4) holds. Note that by assumption, po 4 qo. 


3.3.2 The Key Sieve Lemma 


To continue, we need a tool that allows us to count solutions to (4). We use 
the following variant of Ford and Pollack [7, Lemma 4.1], which was proved by 
repeated application of the upper bound sieve. As the required changes to the proof 
of Ford and Pollack [7, Lemma 4.1] are little more than typographical, we omit the 
demonstration. 


Lemma 3.2. Let y be large, k = 1,1>0,30<S<...< 9 = y, and uj < v; 
for 0 <j < k—1. Suppose that 1 < B < y'/!°, and put 6 = ,/log, S/ log, y. 


'Precisely: With n := 10L./log, S/ log, x, we choose k = ko unless x;.—1 (15 X) — Xk (n3 x) < 207. 
This becomes relevant for verifying that the intervals [u;, v;] selected later in the proof satisfy the 
conditions of Lemma 3.2 below. Since we will refer to [7] for the selection of u; and v; and this 
verification, it does not make sense here to go into more detail. 
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Set vj = log, v;/ logy y and fu; = log, uj/ log, y. Suppose that d is a natural number 
for which Pt (d) < vg. Moreover, suppose that both of the following hold: 


(a) For2 <j < k—1, either (uj, vj) = (uj-1. vj-1) or vy < pj-1 — 26. Also, 
Ve S fe — 26. 
(b) Forl1 <j <k—2, we have v; > vj+2. 


The number of solutions of 
(po + 1) +++ (De-1 + fd = (go + 1)+++ u-i + Ie < y/B, 


in Po, +++ >Pk—1) 90s +++> Gk-1,€,f Satisfying 
F k-1 k- 
(i) ged Tico i. T]ja9 @) = 1s 
ii) p; and q; are S-normal primes; 
2 q P 

(iii) us < PT (pi +), PT (Git 1) <vifor0<i<k-1; 
(iv) no p; + 1 or gq; + 1 is divisible by r for a prime r = vx; 

(v) Pt (ef) < uy; Q(f) < Allog, uv; 
(vi) po + 1 has a divisor > y'/? which is composed of primes > v; 


is 


y 0 - hI oy 
<K pC eek + 1)? (log 5 alle Bk+D+1 (og y) 24+] aj (+E 


where E = 6 paar logi+ i) + 2 — pj). Here c is an absolute positive 


i=l 
constant, and the a; are as defined in (3). 

Remark 3.4. The condition (i) is not present in [7, Lemma 4.1]. The explanation 
is that applying the upper bound sieve requires a linear independence condition on 
the linear forms. This condition is automatic when treating Eq. (2), because the left- 
hand shifted prime factors are shifted by —1 whereas the right-hand shifts are by 
+1. In (4), the primes are shifted by +1 on both sides, forcing us to assume (i). 


3.3.3 Capturing Solutions with Lemma 3.2: Attempt #1 


Given a pair (a, a’) € &% x WG, satisfying 
a(a)=o(a) and PT(a)#Pt(a), (6) 


we described in Sect. 3.3.1 how to construct a solution to (4). By a “solution,” we 
mean the values of po, ...,Pr—1, Go.---> Gk-1, @, e, and f. On the other hand, given 
a solution to (4) that arose this way, we can recover the common value of o(a) and 
o(a’) by computing either side of (4). So we can bound the number of exceptional 
o-values having all preimages in .% by the number of solutions to (4) that arise— 
in the manner detailed in Sect.3.3.1—from (a,a’) € % x & satisfying (6). 
Henceforth, when we speak of a “solution” to (4), we always mean a solution that 
arose this way. 
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We group solutions to (4) according to the value of k, the value of d, and the 
“rough positions” of the primes Pt (p; + 1) and P* (q; + 1). (In view of the fact that 
each pj, qj is normal, this is essentially the same as grouping by the positions of the 
pi and q; themselves, but turns out to be technically more convenient.) Our hope is 
to apply Lemma 3.2 to bound the number of solutions in each group, and then sum 
over all the groups. 

Suppose we start with a solution to (4) and want to place it in a group. What 
does it mean precisely to specify “the rough positions” of the primes Pt (p; + 1) 
and Pt (q; + 1)? We will interpret this to mean that we specify uo,...,uz—1 and 
Uo,..., Uz SO that, taking y := x, 


° 30<S<yuSuei1S...<v=y 
° uj <vjforO<j<k—-I1, 

* (a) and (b) in Lemma 3.2 hold, 

* (iii) holds. 


A systematic way of choosing u; and v; to satisfy these criteria is described in detail 
in [7, Sect. 5B].? 
Moreover, if one selects the u; and v; by that recipe, and takes 


B=1 and /=L-k, 


then our solution to (4) satisfies not only (iii) but in fact every condition of 
Lemma 3.2 except possibly condition (i). That is, P*(d) < v, and all of (ii)-(vi) 
hold. This follows mutatis mutandis from the corresponding proofs in [7, Sect. 5C]. 
The only point that merits further discussion is the verification that P* (ef) < v; (as 
claimed in (v)) and the related point that Pt (d) < v,. Here the more wild behavior 
of o on prime powers, vis-a-vis g, complicates matters. 

Let r:= P*(e). Since k < Land’ has at least L+ 1 distinct odd prime divisors, 
€ = 0(4xge+i°*:) > 1 and sor > 1. Choose a prime power R exactly dividing 
UVk+19k+2°°* for which r | o(R). If R is a proper prime power, then (1) in the 
definition of .%, implies that R < (log x)? and so 


r <0(R) < 2(logx)? < S < uy. 


So we can assume that R is a prime divisor of qxqx4i--:. Then r | R + 1, and 
r<Pt(R+ 1) < max{3, R} < gy. From the fifth display on [7, p. 1692], 


log, qx/ log, x < log, px/ logy x + (2k + 1) Vlog, S/ log, x < log, vg/ log, x. 


Thus, r < qx < v,. Anentirely similar argument shows that Pt (o(pype4i--:)) < 
vx, So that both PT (f) < vz and P*(d) < ug. 


Since we are working with solutions to (4) instead of (2), one should read [7, Sect. 5B] mentally 
replacing each expression of the form p — | with p + I. 
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If (i) were to always hold, it would be clear how we ought to finish the proof 
of Theorem 1.2. In that case, every solution to (4) would fit in a group of the sort 
counted by Lemma 3.2. Summing the estimate of the lemma over the possible k, 
d, and uj, v; (that is, over all possible groups of solutions) would give us an upper 
bound on the count of all solutions. However, there is no reason for (i) to always 
hold. It could well be that the list p;,..., pg—1 overlaps with the list g,..., qx-1. So 
we must work a bit harder before Lemma 3.2 can be applied. 


3.3.4 Attempt #2 


There is an easy fix for the problem we have just run into: Do not attempt to apply 
Lemma 3.2 until after canceling factors arising from the common large primes! 
Given a solution to (4), put 


m = gcd(po +++ Pe—-1, 90°" * Yk—-1) 


By assumption, po # qo. It follows that neither po nor qo can divide m. Indeed, 
from conditions (3) and (7) in our definition of <%, both po, go > x!/2 while each 
Pi gi < x™R* fori > 1. 

For each prime p dividing m, cancel the factors of p + 1 from both sides of (4). 
Relabeling, we obtain an equation of the form 


(Po + 1) +++ @x-1 + Ddf = (Go + D-++ Gx-1 + De (7) 


where K = k — w(m) and the common value of both sides of (7) is at most 
x/o(m). We may assume that the p; and g; are in nonincreasing order. Since 
gcd(m, pogo) = 1, we have K > 1, po = po, and Go = qo. Write each 


Pi=Pj, and gi= qj, 


where the indices i and 7’ satisfy i < j;,j; <kforO <i<K. 

In the last section, our choices of parameters in Lemma 3.2 possibly failed to 
capture the solution po,...,Px—1, Go, ---»Gk-1, 4, e, f to (4). We now describe how 
to capture the solution po,...,Px-1, Go,---,9x—1, d, e,f to (7) by making slightly 
different choices of these parameters. 

We continue to assume that u; and v; are chosen as in the preceding section. Since 
hypothesis (a) of Lemma 3.2 holds, for i > 1 the intervals [u;, vj] and [uj+1, vi+41] 
either coincide or are disjoint. Now appealing to (iiit)—which was also satisfied for 
our choices of u;, v;j—we see that 


uj, =u <PT D+ 1),P Tq +1) < 1 = vy 
for every 1 <i< K-—1. Put 


uj = uj, and v; = v;, forO<i<K—1, andput ve = vy. 
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From the second half of condition (7) in the definition of .% and the estimate 
k <L= O(log; x), 


O(m) Sm? < (ypu)? S xe! WE”, 


Hence, 


o(m) < x!/10 


for large x. 
We will apply Lemma 3.2 with 


y=x, K playing therole ofkinthelemma,  4u;,, v; in place of u;, v;, 


B=o(m), dasbefore, /as before (i.e., / = L — k for our original k). 


Since all of the previous hypotheses except (possibly) (i) held for our solution to (4), 
all of the statements in Lemma 3.2 are satisfied for our solution to (7). That is, with 
this choice of parameters, Lemma 3.2 succeeds in capturing our solution to (7). 

Now given m, one can recover the original solution to (4) from the solution to 
the canceled form (7). So to count solutions to (4), it is enough to sum the upper 
bound of the lemma over possible values of the parameters d, m (which determines 
B = o(m)), k (which determines / = L — k), K, the u;, and vj. 


3.3.5 Finishing Up 

Making analogous calculations to those on [7, pp. 1693-1694], the upper bound 
arising from a single application of Lemma 3.2 is seen to be 

Le@ 

a 


x = 1 1/2+¢€/2 
nie exp(—3 (log, x) ) 
(compare with [7, Eq. (5—13)]). It remains to sum on d, k, K, m, uj, and Vj. 

Since each of k and K is bounded by L, there are only O((log; x)”) possibilities 
for the pair (k, K). Reasoning as in [7, Eq. (5—14)], the number of possibilities for 
the i; and 0; is bounded by exp(O((log; x))). 

To handle the sum on d, we first establish the uniform bound 


Q(d) « (log, x). 


Recall that we defined d so that d = o(h) for some unitary divisor h of a supported 
on primes < pz. If h < exp((log, x)'/*), then the desired bound on 92(d) follows 
from the estimate 2(d) < log d. Otherwise, conditions (3) and (4) in the definition 
of & give us 


Q(d) < 10log, o(h) « logy h < log, py", 
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and this is also O(log, x)'/*), by the calculation in the final display of Ford and 
Pollack [7, p. 1694]. It follows that 


L° < exp(O((log, x)'/ log, x)). 


Recall that P+ (d) < vy. Our choice of vz now yields log, Pt (d) < (log, x)!/?+«/5 
[7, Eq. (5—12)]. Hence, 


1 
YF < expt(log, 9"). 


Assembling the preceding estimates, we find that the number of solutions to (4) 
corresponding to a given value of m = gcd(po... Pk—1, Yo*** Ye-1) iS at most 


x 
—— exp (—(lo ue), 

een iens p (—(log, x)'*) 

It remains finally to treat the sum over m. Since m divides p, -- + px—1, we have that 
m is squarefree and (recalling (5)) 


1 
a(m)<k< 5b < 0.9 log; x. 


Hence, 


j 
Ds ini sy (> a < exp((log; x)”), 


J! 
m J<0.9 logs x psx 


by a short calculation using Mertens’s estimate i _ p' = log,x + O(1). We 
conclude that the total number of solutions to (4) is bounded by (say) 


x 1 
a —-(] 1/2) - 
exp (= (08339 


As discussed before, this is also an upper bound on the number of exceptional o- 
values in [1,x] all of whose preimages belong to %. Since this upper bound is 
certainly o(V,(x)), the proof of Theorem 1.2 is complete. 


Remark 3.5. Recall that € > 0 can be taken arbitrarily small in the definition of 
@,. From the above argument and Proposition 3.2, it follows that the number of 
exceptional o-values in [1, x] is at most V, (x)/(log, x)!/2+°, as x > oo. 


Remark 3.6. Using the preceding remark and further ideas from [6, 7], one can 
establish the following strengthening of Theorem 1.2: For each fixed K, almost all 
o-values in [1,x] are such that all of their preimages share the same largest K + 1 
prime factors. One could even extend this to certain functions K = K(x) — oo, but 
we do not pursue this possibility here. 
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Remark 3.7, Theorem 1.2 as well as the comments in the preceding remarks all 
hold with o replaced by Euler’s g-function, by essentially the same proofs. 


4 Proof of Corollary 1.1 


Proof. Suppose that v < x is the common o-value of some amicable tuple. Then 
there are n,,...,m, with = n; = v and each o(n;) = v. Clearly, each n; < x. 
By Theorem 1.2, we can assume that all the n; have the same largest prime factor P. 
We can also assume that P > z, where z := x!/(4l08!°8) | Otherwise, a crude upper 
bound on the count of smooth numbers (such as [17, Theorem 1, p. 359]) shows 
that each n; is restricted to a set of size < x/(logx)? = o(V,(x)), which would 
mean that v = o(n;) is also so restricted. For the remaining values of v, observe 
that P divides xy nj = v = a(n). Write n; = Pm. If P | m, then n, is divisible 
by the square of a prime exceeding z, leaving < eee p? & x/z = o(V,(x)) 
possibilities for m,. So assume that P + m. Since P divides o(n,) = (P + l)o(m) 
and P = P*(n,), there must be a proper prime power R dividing m for which 
P | a(R). Since P divides o(R) and o(R) < 2R, we have that R > P/2 > z/2. 
Thus, 1; possesses a squarefull divisor exceeding z/2, which restricts n;—and also 
v = o(n,)—to a set of size < x/,/z = o(V,(x)). 


Acknowledgements The author thanks Kevin Ford for guiding him through the arguments of 
Ford [6]. He also thanks the referee for useful feedback that led to improvements in the exposition. 
Much of this work was conducted during the Fall 2009 semester, while the author was visiting 
the Institute for Advanced Study. He thanks the IAS for providing an ideal working environment. 
While at the IAS, the author was supported by National Science Foundation award DMS-0802970; 
currently, he is supported by NSF DMS-1402268. 


References 


1. L.E. Dickson, Amicable number triples. Am. Math. Mon. 20, 84-92 (1913) 

2. L.E. Dickson, History of the Theory of Numbers. Volume I: Divisibility and Primality (Chelsea 
Publishing Co., New York, 1966) 

3. P. Erdés, G.J. Rieger, Ein Nachtrag tiber befreundete Zahlen. J. Reine Angew. Math. 273, 220 
(1975) 

4. P. Erdés, On amicable numbers. Publ. Math. Debrecen 4, 108-111 (1955) 

5. P. Erdés, Remarks on number theory. Il. Some problems on the o function. Acta Arith. 5, 
171-177 (1959) 

6. K. Ford, The distribution of totients. Ramanujan J. 2, 67-151 (1998). Revised version available 
as arXiv:1104.3264 [math.NT] 

7. K. Ford, P. Pollack, On common values of y(n) and o(m), II. Algebra Number Theory 6, 
1669-1696 (2012) 

8. H. Maier, C. Pomerance, On the number of distinct values of Euler’s g-function. Acta Arith. 
49, 263-275 (1988) 


320 P. Pollack 


9. A. Makowski, On some equations involving functions y(n) and o(n). Am. Math. Mon. 67, 

668-670 (1960) 

10. T.E. Mason, On amicable numbers and their generalizations. Am. Math. Mon. 28, 195-200 
(1921) 

11. J. Maynard, Dense clusters of primes in subsets. Preprint online as arXiv: 1405.2593 [math.NT] 
(2014) 

12. C. Pomerance, On the distribution of amicable numbers. J. Reine Angew. Math. 293/294, 
217-222 (1977) 

13. C. Pomerance, On the distribution of amicable numbers. II. J. Reine Angew. Math. 325, 
183-188 (1981) 

14. C. Pomerance, On amicable numbers (2014). To appear in a Springer volume in honor of H. 
Maier 

15. P. Poulet, La chasse aux nombres. I: Parfaits, amiables et extensions (Stevens, Bruxelles, 1929) 

16. A. Schinzel, W. Sierpiriski, Sur certaines hypothéses concernant les nombres premiers. Acta 
Arith. 4, 185-208 (1958). Erratum 5, 259 (1959) 

17. G. Tenenbaum, Introduction to Analytic and Probabilistic Number Theory. Cambridge Studies 
in Advanced Mathematics, vol. 46 (Cambridge University Press, Cambridge, 1995) 

18. Y. Zhang, Bounded gaps between primes. Ann. Math. 179, 1121-1174 (2014) 


On Amicable Numbers 


Carl Pomerance 


To Professor Helmut Maier on his 60th birthday 


Abstract Let <(x) denote the set of integers n < x that belong to an amicable pair. 
We show that #.o/ (x) < x/ev!°S* for all sufficiently large x. 


Keywords Amicable number 


Mathematics Subject Classification: 11A25, 11N25 


1 Introduction 


Let o denote the sum-of-divisors function and let s(n) = o(n) — n. Two different 
positive integers a, b with s(a) = b and s(b) = a are said to form an amicable pair. 
This concept is attributed to Pythagoras and has been studied over the millennia 
since both by numerologists and number theorists. The first example of an amicable 
pair is 220 and 284. About 12 million pairs are now known, but we don’t have a 
proof of their infinitude. 

Say a positive integer is amicable if it belongs to an amicable pair and let / 
denote the set of amicable numbers. Kanold [9] was the first to consider </ from 
a statistical viewpoint, announcing in 1954 that </ has upper density smaller than 
0.204. Soon after, Erdés [5] showed they have asymptotic density 0. In the period 
1973-1981 there were several papers getting successively better upper bounds for 
#9 (x), where &(x) = & NM [1,x]. Somewhat simplifying the expressions, these 
upper bounds have progressed as follows: 


Xx Xx Xx XxX 


(log log loglogx)'/?" logloglogx’ exp ((log loglog.x)'/2)’ exp ((log x)!/3) ’ 
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see [7, 14-16], respectively. In this paper we are able to replace the exponent 1/3 in 
the last estimate with 1/2. In particular, we prove the following theorem. 


Theorem 1.1. As x — ov, we have 


#.f (x) < x/ exp ((4 + o(1)) log xlog log Tog x) . 


The proof largely follows the plan in [15], but with some new elements. In particular, 
a separate argument now handles the case when an amicable number n is divisible 
by a very large prime p. Thus, assuming the largest prime factor p of n is not so large, 
it can be shown that, usually, a fairly large prime divides o (n/p). The argument to 
handle the case of p large is reminiscent of the paper [13] which deals with Lehmer’s 
problem on Euler’s function g, also see the newer paper [1]. In addition, we use a 
result in [2] to streamline the argument that o(n/p) has a large prime factor. 

The argument in [15] was subsequently used to estimate the distribution of 
numbers n with g(n) = y(n + 1), and some similar equations; see [6, 8]. These 
results were since improved in [17]. However, it is not clear if the method of Yamada 
[17] can be used for the distribution of amicable numbers. 

We record some of the notation used. Let P(n) denote the largest prime factor of 
n > 1 and let P(1) = 1. We say an integer n is squarefull if for each prime p | n 
we have p” | n. We use the notation (a, b) for the greatest common divisor of the 
positive integers a, b. We write d || nif d | n and (d,n/d) = 1. 

Note that if n is large we have s(n) < 2n log logn. 


2 A Lemma 


Let &(x, y) denote the number of integers n € [1,x] with P(g(n)) < y. In [2, 
Theorem 3.1] it is shown that for any fixed € > 0, we have (x, y) < xexp(—( + 
o(1))ulog log u) as u + oo, where u = logx/ logy and (loglogx)'** < y < x. 
At first glance one might think there is a typographical error here, since the 
corresponding result of de Bruijn [4] counts n € [1,x] with P(n) < y, and the 
upper bound is xexp(—(1 + o0(1))ulog u), which is supported by a corresponding 
lower bound, see [3]. However, a heuristic argument indicates that it is likely that 
for ®(x, y), replacing log u with log log u is correct (see [2, Sect. 8] and [10]). Until 
we have a corresponding lower bound, we will not know for sure. 

In this paper we will require an estimate for the number of n € [1,x] with 
P(o(n)) < y. The function o() closely resembles the function g(1); we see “p+ 1” 
in o(n), where we would see “p — 1” in y(n). However, it is not as simple as this, 
since o treats higher powers of primes differently than ¢. It is possible to overcome 
this difference since most numbers n are not divisible by a large squarefull number. 
But to keep things simple, we restrict our numbers 7 in the following result to 
squarefree numbers. Let 5’(x, y) denote the number of squarefree numbers n € [1, x] 
with P(a(n)) < y. 
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Lemma 2.1. For each fixed € > 0, we have S(x,y) < xexp(—(1+0(1))u log log u) 
as u — 00, where u = log x/ logy and (log logx)'** < y <x. 


As indicated above, the proof of this result follows from small cosmetic changes to 
the proof of the corresponding result on ®(x, y) in [2]. 


3 Proof of Theorem 1.1 


Let x be large and let 
L= L(x) = exp (5 log x log log log x) . 


(i) Forn € &(x), we may assume that n > x/L and s(n) > x/L. 


The first assertion is obvious and the second follows from the fact that each 
n € &@(x) is determined by s(n). 


(ii) Forn € (x) we may assume that the largest divisor d of n with P(d) < L? has 
d < x'/?, and similarly for s(n). In particular, we may assume that P(n) > L? 
and P(s(n)) > L?. 


Indeed, by [4], for z => x!/3, the number of integers d < z with P(d) < Lv’ 
is O(z/L), so by partial summation, the number of n € (x) divisible by such a 
number d > x!/? is O(x(log x)/L). A parallel argument holds for s(n). The assertions 
about P(n), P(s(n)) now follow from (i). 


(iii) Forn € A(x) we may assume the largest squarefull divisor of n is at most L’, 
and the same for s(n). 


Since the number of squarefull numbers d < z is O(,/z) for all z > 1, partial 
summation implies that the number of integers n < x divisible by a squarefull 
number d > L” is O(x/L). A similar estimate holds for s(n). 

Note that ifn € .o/(x), then (ii) and (iii) imply that P(x) || n and P(s(n)) || s(n). 
For the remainder of the proof, for n € (x) we write: 


n=pm, p=P(n)}m, s(n)=n! =p'm, p' = Pir’) tm. 


Note too that for x large we have n’ < 2x log log x. 
(iv) Forn € &(x), we may assume that P((n, s(n))) < L. 


Suppose n € (x), r = P((n,s(n))), and that r > L. Then r | a(n), so there is 
a prime power q/ | n with r | o(q/). We use an elementary inequality found in the 
proofs of [11, Lemma 3.6], [12, Lemma 3.3]: for any positive integer d, 
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1 (log x)” 
qi d 


(1) 


dsx 
djo(q) 


We Apply (1) with d = r, getting that the number of n < x with r | n and r | o(n) 
is O(x(log x)?/r’), and so the number of n which violate (iv) is at most a constant 
times 


x(log x)” 


1 
x(log x)? » 72 << L 


r>L 
This estimate shows that we may assume (iv). 
(v) Forn € &(x), we may assume that mm! > x/L. 


Suppose n € .o/ (x). We have 
pim! = s(n) = o(pm) — pm = ps(m) + o(m), (2) 
p'o(m') + o(m’) = o(p'm’) = o(n) = po(m) + o(m). (3) 
Multiplying (2) by o(m), (3) by s(m) and subtracting to eliminate p, we have 
p'm'o(m) — p'o(m')s(m) — o(m')s(m) = o(m)? — o(m)s(m) = mo(m). 
Thus, 
p (m'o(m) — o(m')s(m)) = o(m')s(m) + mo(m). (4) 


Since the right side of (4) is positive, we see that m,m’ determine p’, and by 
symmetry, they also determine p. So the number of cases for which (v) fails is at 
most 


Y> 1 = O(x(logx)/L). 


mnt <x/L 


(vi) Forn € &(x), we may assume that p, p! < x°/+L. 


Suppose n € (x) and p > x°/4L. Then m < x!/*/L, so by (v), m' > x°/*. Since 
n’ < 2xloglog.x, we have p’ < 2x!/4loglog x. Write the prime factorization of n/ 
as P\p2...P;, Where p; = p’ and py > p2 >... > p;, and forj = 1,2,...,1, let 
D; = p\p2...p;. By G), some Dj > x'/?; let D be the least such divisor of n’. Since 
p, < 2x!/4 log log x, we have 


x/2 <D < 23/4 log log x. (5) 
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Further, by (ii) and (iii), D is squarefree, D || n’, and every prime dividing D is larger 
than L?. From the identity 


s(m)o(n') = s(m)o(n) = o(m)o(n) — o(n)m = a(m)o(n) — o(m)(p + 1)m 
= a(m)a(n) — o(m)(n + m) = o(m)n' — o(m)m, 
we have 
s(m)o(D)o(M) = o(m)DM — mo(m). 
Reading this equation as a congruence modulo o(D), we have 
o(m)DM = mo(m) (mod o(D)). 


The number of choices for M < 2x(log log x)/D which satisfy this congruence is at 
most 


2x log log x 24 2xo0 (m)(D, o(D)) log log x 


' + D5(D)/(e(mD.o(D) ~ D? 


Since (D,o(D)) | (n,n’) and every prime dividing D exceeds L* > L, (iv) implies 
that (D,o(D)) = 1. So, for a given choice of m, D, the number of choices for M is 
at most 


2 

= entrees) 
We sum this expression for D satisfying (5) and m < x'/+/L and so get that the 
number of choices for n € (x) with p > x°/4L is O(x(loglogx)/L). A similar 
argument holds if p’ > x°/4L. We conclude that the number of cases where (vi) fails 
is negligible. 

For n = pm € &(x) we write m = mom, where m, is the largest squarefree 
number with m, || m, and we similarly write m! = mpm’. 


(vii) Forn € &(x), we may assume that P(a(m,)) > Land P(a(m})) > L. 
Assume for 1 € (x) that the first condition in (vii) fails. (The argument for the 

second condition will follow similarly.) By (iii) and (vi), pm < 3/473. For given 

choices of p and mp we count the number choices of squarefree integers m, < x/pmpo 


with P(o(m,)) < L. For this, we use Lemma 2.1. Let u = log(x/pmo)/ log L. Since 
pmo < x°/4L3, we have 


log (x!/4/L3 lo 
— g ( / ) = (1 +0(1)) gx 
logL log log log x 
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so that 


uloglogu = (3 + o(1)) log x log log log x = (1 + o(1)) logL 
as x —> co. Hence, we uniformly have that the number of choices for mm is at most 


Xx 
pnigL' +0) 


as x — oo. We now sum on p, mp getting that the number of n € (x) for which 
P(a(m,)) < Lis at most x/L't° as x > oo. 

We now turn to the conclusion of the argument. We suppose that n € (x) 
and that (i)-(vii) hold. At the cost of doubling our count and letting n run up to 
2x log log x, we may assume that p > p’. (That p # p’ can be seen from (ii), (iv).) 
By (vii), there is a prime r | o(m,) with r > L. Thus, there is a prime g | m with 
q =—1 (mod r). This implies that g > L, so that by (iv), q +n’. But a(n) = a(r’), 
so there is a prime power & | n’ with £ # q andr | o(l). Note that, by (1), 
> 1/ « (log x)?/r. We have 


n’ = s(n) = ps(m) + o(m) = 0 (mod &’/). (6) 


Say ¢' = (€/, s(m)), so that p is in a residue class a(m) (mod &~). Also, (6) implies 
that £' | o(m), so €' | m. Further, using (ii), (iii), and p > £, we may assume that 
p > U. The number of such numbers n < 2x log log x is at most 


ie ae ee ee ee 


r>L j MS<X — pX2x log log x/m 
q= Hind r) Nod) qt'|m p =a(m)(mod &/—*) 
p> 


2x log log log x log log 
SUL LL re « yy) 
i r q bi i 
& 3 sy y logs ee logx & 3 3 “ees Tes logx 
rog @ rooq 


x(logx) loglogx | x(log x) log log x 
ae r « 7 


’ 


r 


where we treated r,q,€,p as integer variables. This calculation along with the 
previous cases finishes the proof. 
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Trigonometric Representations of Generalized 
Dedekind and Hardy Sums via the Discrete 
Fourier Transform 
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To Professor Helmut Maier on his 60th birthday 


Abstract We introduce some new higher dimensional generalizations of 


the 


Dedekind sums associated with the Bernoulli functions and of those Hardy sums 
which are defined by the sawtooth function. We generalize a variant of Parseval’s 
formula for the discrete Fourier transform to derive finite trigonometric represen- 
tations for these sums in a simple unified manner. We also consider a related sum 


involving the Hurwitz zeta function. 


Keywords Dedekind sums ¢ Hardy sums ¢ Bernoulli polynomials and functions 


¢ Hurwitz zeta function * Discrete Fourier transform 
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1 Introduction 


The classical Dedekind sum is defined for h € Z, k € N := {1,2,...} by 


amry= © ((9)((#)). 


a (mod k) 


adopting the usual notation 
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{x} — 5. if x ZZ, 


= ie if x Z, 


where {x} := x— |x| stands for the fractional part of x (cf. [9, 13]). If ged(h, k) = 1, 
then s(h, k) can be represented as 


s(h, k) = zLon(% “) co (=) (1) 


and 


sth) = 5 - cot (7). (2) 


r=1 
r¥0 (mod k) 


The identities (1) and (2) were derived in 1933 by Rademacher [12] in order to 
obtain a simple direct proof of the reciprocity formula for the Dedekind sums. See 
also [13, pp. 18-25]. According to [1, p. 347], (1) was obtained earlier, in 1923 
by Mellin. The identity (1) is also the starting point for various generalizations of 
s(h, k). See, e.g., the papers of Beck [2], Dieter [8], Zagier [16]. 

It is known that (1) is a direct consequence of a variant of Parseval’s formula 
for the discrete Fourier transform (DFT). See the paper of Almkvist [1, Sect. 6] and 
the book by Beck and Robins [4, Chap. 7]. More specifically, consider a function 
f :Z-— C, which is k-periodic (periodic with period k), where k € N. We define 
the DFT of f as the function f = F(f), given by 


fn) := = f@er"* (n € Z). 


a (mod k) 


Furthermore, if f; and /2 are k-periodic functions, then their inner product is 


(i.h) = Yo f@h@. 


a (mod k) 
having the property 
1 ~~ 
(it) = ph), 


or equivalently, 


Y Aonca=— Y hah. 3) 


a (mod k) a (mod k) 
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Now, (1) follows by applying (3) to the functions 


nw =((%)). no = ((#)) 


and using the fact that the DFT of the sawtooth function is essentially the cotangent 
function. 

It is the aim of this paper to exploit this idea in order to deduce similar finite 
trigonometric representations for certain new generalized Dedekind and Hardy 
sums, in a simple unified manner. Our results are direct applications of a higher 
dimensional version of the identity (3), included in Theorem 2.1. We derive in 
this way Zagier-type identities for new higher dimensional generalizations of the 
Dedekind sums associated with the Bernoulli functions and of those Hardy sums 
which are defined by the sawtooth function. Note that all finite trigonometric 
representations we obtain contain only the tangent and cotangent functions, and are 
special cases of the Dedekind cotangent sums investigated by Beck [2]. Therefore 
the reciprocity law proved in [2, Theorem 2] can be applied for each sum. 

Furthermore, we consider a related sum, studied by Mikolas [11], involving 
the Hurwitz zeta function. We remark that (3) was used to evaluate some finite 
trigonometric and character sums, but not Dedekind and related sums, by Beck 
and Halloran [3]. We point out that the identity (2) can be obtained from (1) using 
another general result (Lemma 4.2 in Sect. 4). 


2 Properties of the DFT 


We will apply the following general result. 


Theorem 2.1. Let fi,...,fm be arbitrary k-periodic functions and let h € Z, 
gcd(hj,k) = 1 (1 <j < m), where m,k € N. Then 


1 A -_ 
ilar) +-Fn(Qmbin) =~ DY Alay) ---fnlaly,), 
\,...,4m (mod k) a(mod k) 
a, +...+an=0 (mod k) 


Il 
= 


where h;, is the multiplicative inverse of h; (modk), that is hjh; 
(mod k) with 1 <j < m. 
Proof. We only need some simple well-known facts concerning the DFT. See, for 


instance, [15, Chap. 2], [4, Chap. 7]. The Cauchy convolution of the k-periodic 
functions f; and f, is defined by 


A®AM= Yo fi@h@)= Yo A@p—-a) (ne Z), 


a ,az (mod k) a (mod k) 
a, +a2=n (mod k) 
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which is associative and commutative. Also, 
A®@h=fih- 
More generally, if fi, ...,fm are k-periodic functions, then 
F (fi @+++ @fm) = F(fi)-++ Fm): 
Recalling that 
F(F (f)\(n) = kf(-n) (n€ Z), 
which is valid for every k-periodic f, this yields 


K(f Qe: @ fin) (—n) = F (F (fi) --- F Fn) )(n), 


that is 
1 mn ee 
fila) +-Finlan) = YD AO) --fn(ave Pr" (n € 2). 
),...,dm (mod k) a(mod k) 
a\+...+am=—n (mod k) 


For n = 0 we obtain 


1 ~ Zs 
ye fila) -+-fmlQm) = = DY A@-+-fnl). (4) 
\,...,4m (mod k) a (mod k) 
a, +...+an=0 (mod k) 


Now the result follows from (4) by showing the following property: Let f be a 
k-periodic function and let h € Z such that gcd(h,k) = 1. Then the DFT of the 
function g defined by g(n) = f(nh) (n € Z) is &(n) = f(nh’) (n € Z), where hr’ is 
the multiplicative inverse of h (mod k). 

Indeed, 


&(n) = x. s(aje rr" _ a f(ahye 2miahhn/k 


a(mod k) a (mod k) 


and since gcd(h, k) = 1, if a runs through a complete system of residues (mod k), 
then so does b = ah. Therefore, 


(n) = > fib es = f(nh’). 


b (mod k) 


Generalized Dedekind and Hardy Sums 333 


Corollary 2.1. Let f, and f. be k-periodic functions (k € N) and let h,,hy € Z, 
gced(h1,k) = gced(h2,k) = 1. Then 


Y Alahyplats) =~ > fiahayfitan). (5) 


a(mod k) a(mod k) 
Proof. Apply Theorem 2.1 form = 2. We deduce that 
1 ~ pa 
de filahifa(—aln) = DT filahfa(ans) 


a(mod k) a (mod k) 


II 


1 — — 
i CALS ACAD) 


a (mod k) 


II 


YS fempon), 


b (mod k) 


by using the fact that if a runs through a complete system of residues (mod k), then 
so does b = ah{h, since ged(hyhz, k) = 1. This gives (5) by setting hy := —hn. O 


For hy = 1, ho = —1 from (5) we derive (3). 


Corollary 2.2. Let f\ and fy be k-periodic functions (k € N) and assume that f, or 
Jo is odd (resp. even). Let hy, hy € Z, gced(hy, k) = ged(h2, k) = 1. Then 


Y Alain yp(ats) = — > Fran fran), 6) 
a(mod k) a(mod k) 


where s = 1 if f, or fo is odd, s = O if f, or fa is even. 


Note that if the function f is odd (resp. even), then i: is also odd (resp. even). If 
one of the functions f, 2 is odd and the other one is even, then both sides of (6) are 
zero. 

In this paper we will use the following DFT pairs of k-periodic functions. 


Lemma 2.1. (i) Letk € N. The DFT of the k-periodic odd function f(n) = ((2)) 
(né€ Z) is 


a Scot(#), ifktn, 
I) 0, if k | n. 


(ii) Letk € N and let B, (r € N) be the Bernoulli functions (cf. Sect. 3.1). The DFT 
of the k-periodic function f (n) = B, (2) (n€ Z) is 
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rk (2) cot") (#4), if kk tn, 


f(n) = 
B,k'~, if k|n, 


where B, is the r-th Bernoulli number and cot is the m-th derivative of the 


cotangent function. 
(iii) Letk € N be even. The DFT of the k-periodic odd function f (n) = (—1)" ((7)) 
(n€ Z) is 
m —L tan (22 k k 
A = 5 tan (3"), ed a )s 
0, if n = 5 (mod k). 


(iv) Let k be odd and let n (mod k) = n—k\n/k| be the least nonnegative residue 
of n (mod k). The DFT of the k-periodic odd function 


(—1)" (mod k) if k f n, 


fo = 4 ae 


fn) = itan (=) (n€Z). 


(v) Let k € N. Let F(s,x), €(s,x), and f(s) be the periodic zeta function, 
the Hurwitz zeta function, and the Riemann zeta function, respectively (cf. 
Sect. 3.3). For Ns > 1 the DFT of the k periodic function f(n) = F(s, ¢ 
(né Z) is 


joy aE ROAR). etn 
k!-8C(s), if k | n. 
Here (i) and (iv) are well known. They follow, together with (iii) and (v), by easy 


computations from the definition of the DFT. For (ii) we refer to [2, Lemma 6]. See 
also [1, Sect. 6]. 


3 Applications 


3.1 Generalized Dedekind Sums 


We first derive the following higher dimensional generalization of the identity (1), 
first deduced by Zagier [16, Th. p. 157], in a slightly different form, by applying 
some other arguments. 
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Theorem 3.2. Let k € N, m € N be even and let h € Z, ged(hj,k) = 1 


(1 <j < m). Then 
> ((“")) _ ((“#")) 
\,...,4m (mod k) : ‘ 


ay +...+d,=0 (mod k) 


2 kl 


(-1)”/ 3 (=) (=) 
= -—— cot | —— ]}---cot{ — }. (7) 
i k k 
Proof. Apply Theorem 2.1 for fi = ... = fin = f, where f(n) = ((2)) (n € Z). 
Use Lemma 2.1/(i). oO 


Note that if m is odd, then both sides of (7) are zero. 


Corollary 3.3. Assume that m = 2. Letk € N, hi,kng € Z, ged(hy,k) = 
gced(h2,k) = 1. Then 


L(E)(@)-ale(P CF) © 


a=1 a=1 


Identity (8) is the homogeneous version of (1) and is equivalent to (1). 
Now consider the Bernoulli polynomials B,(x) (r => 0), defined by 


tex! _ S BQ) 
~ r! : 


e'—] = 
Here 
B(x) = x—1/2, Bo(x) = x? —x+1/6, B3(x) = x° —3x°/24+x/2 and B, := B,(0) 
are the Bernoulli numbers. The Bernoulli functions x +> B,(x) are given by 
B(x) = B,({x}) (xe R). 
Note that 
B(x) = ((x)) for x ¢ Z, 
but 


By (x) = —1/2 4 0 = (a) forx € Z. 
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For r},...,%m € N,A1,...,4m € Z we define the higher dimensional Dedekind— 
Bernoulli sum by 


— fayh = Anh 
Srp secatm My 0+ 5 mk) = a By, () + By, ( k ) : (9) 


d\,...;4m (mod k) 
ay +...+a,=0 (mod k) 


In the case when m = 2 and by setting hz := —hz we obtain the sum 
ie — f{ah,\— [ah 
Sry (hy, ho; k) = ZF (*) B,, (2) (10) 


first investigated by Carlitz [7] and Mikolas [11]. See the paper of Beck [2] for 
further historical remarks. 


Theorem 3.3. Let k,m,r; € N be such that A := rj +... + 1m is even and h; € Z, 
gced(hj,k) = 1(1 <j < m). Then 


B,, -B,,, 
Sry sich Tn (Ay, ’ hn} k) = [A-m+1 
k-1 
(—1)4/217, «++ hn 1 (mahi 1 (mahi, 
+ Dt cottr (Fat) cote (Btn) ay 


Note that if A is odd, then the sum in (11) vanishes. If A is odd and there is at 
least one j such that r; > 3, then B,, = 0 and the sum (9) vanishes as well. 


Proof. Apply Theorem 2.1 and Lemma 2.1/(ii) to the functions 


fin) =B, (7) A sjism). 


oO 


Corollary 3.4 ((2, Corollary 7]). Let k,r},r2 € N, hy, hz € Z be such that r; + ro 
is even and gcd(h, k) = gced(h2, k) = 1. Then 


— fah;\— [ah 
a ee es 
a(mod k) 
k-1 


B,,B,, (-1) 7/27 rp (4-1 mah, -] mah, 
— 2 4 cot!) [| ——* ) cot’27) | —- 3 |). 
es : 


~ ritr2-i Qnitnkritnl k 


a=1 
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3.2 Generalized Hardy Sums 


The Hardy sums (known also as Hardy—Berndt sums) are defined for h € Z,k ¢ N 
as follows: 


S(h, k) = > ene. 


a (mod k) 
s(hki= SY) (-plel ((¢)). 
a (mod k) 
son= ¥ cor(D)((F) 
a (mod k) 
93(h, k) = Fi (-1)* (()) 
a (mod k) 
sq(h, k) = > (—1) lane 
a (mod k) 
55(h, k) := ae ae ((<)) . 


Berndt and Goldberg [6] derived finite and infinite series representations for the 
above sums. These identities were also obtained later by Sitaramachandrarao [14] 
by using some different arguments. One could see [2, 6, 8, 14] for the history of 
these sums as well as for further results on the Hardy sums, including reciprocity 
formulas. 

We define the following generalization of s2(h, k): 


hy Anh 
Ahi, «lm: k) = (-1)" ((“)) a ((*)). 
a 2 k k 


lisiees 
a\+...+am=0 (mod k) 


Theorem 3.4. Let k,m ¢ N be even, h,...,hm € Z, hy odd, gcd(hj,k) = 1 
(1 <j < m). Then 


—| m/2-1 hi hy! hy! 
A(h,..., Amik) = ——— > tan (7 +) cot (74 2). -cot (7), 


l<a<k—-1 


a#k/2 


Proof. Let f\(n) = (—1)" ((2)) and f(n) = ((2)) (2 <j < m). Apply Theorem 2.1 
and Lemma 2.1/(i),(iv). oO 
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Corollary 3.5. Assume that m = 2. Let k € N be even, hy,hy € Z, h, odd, and 
gcd(h,k) = ged(h2,k) = 1. Then 


SG) 2g) 


a#k/2 


In the case m = 2, hy = 1, hy = h we obtain the following corollary, cf. [6, 
Eq. (14)], [14, Eq. (7.3)]. 


Corollary 3.6. [fk € N is even, h € Z, gcd(h,k) = 1, then 


so(h, k) = -Z » tan (=) cot (=*) ‘ 


l<a<k-1 


a#k/2 


Next, we define the following common generalization of s)(h,k), s3(h,k) and 
55(h, k), as follows. 


h mlm 
B(h,.... lm: k) — 1m (—1)0 telah /k] (())--(()) ; 
Am (mod k) 


GY seae5 
a, 40 (mod k) 
ai +...+am=0 (mod k) 


Theorem 3.5. Let k € N be odd, m € N be even, hy € Z, gcd(hj,k) = 1 
(1 <j < m). Then 


KA 
; 177 mahi man, man’, 
Belts. BY = “Geena tan cot ; cot ae 


Proof. Apply Theorem 2.1 to the following functions: 


(17m), if k tn, 


ne =} if k|n, 


fin) =((Z)) @sism 


and also Lemma 2.1/(iv). oO 


Corollary 3.7. Assume that m = 2. Let k € N be odd, hi,hy € Z, h, odd, 
gcd(h,,k) = gcd(ho,k) = 1. Then 


k-1 


k-1 
h 1 tah ah 
= a+ ah, /k] an a t 2 t 1 ; 
CY k Ik 2 sc iy A aa 


a=1 
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For m = 2 in the special cases hy = 1, hy = hand h, =h, hy = 1, respectively, 
we obtain the identities, cf. [6, Eqs. (15), (17)], [14, Eqs. (7.4), (7.6)], as follows 
Corollary 3.8. [fk € N is odd, h € Z, gcd(h, k) = 1, then 


k-1 
1 grah wa 
s(h.k) = = J tan (*) cot (=). (12) 
a=1 


Ifk € N is odd, h € Z is odd, gcd(h, k) = 1, then 


i= wa mah 
55(h, k) = ak ) tan (=) cot (*"*) e 
a=1 


k 


Corollary 3.9. Assume that m = 
gcd(hy,k) = ged(h2,k) = 1. Then 


Ss ()) = 2 San (*0" oa ("0"), 


2. Letk € N be odd, hi,ho € Z, h, even 


k 


For m = 2 in the special case hy = 


h even, hy = 
Eq. (13)], [14, Eq. (7.2)], as follows. 


1 we obtain the identity, cf. [6, 


Corollary 3.10. [fk € N is odd, h € Z is even, gcd(h, k) = 1, then 


k 


j= qa mah 
si(h,k) = OE y tan (=) cot (7) 
a=1 


(13) 


Note that the Hardy sums S(h, k) and s4(h, k) can also be treated with the DFT in 
the case when k is odd. For example, applying Corollary 2.2 to the functions 


Gey mt if k En, 
At =A = i if kIn 


we obtain the following representation. 


Corollary 3.11. [fk € N is odd, hy, hy € Z, gcd(hy,k) = ged(hz, k) = 1, then 


k-1 

h 
yep (mod k) 1 tan (7 “) ta 1 *), (14) 
a=l a=1 


If hy = 1 and h, = h is odd, then the left-hand side of (14) is exactly s4(h, k). 
See [6, Eq. (16)], [14, Eq. (7.5)]. If hy = hz = 1, then (14) provides the following 
classical identity, valid for k € N odd, cf. [3, Proposition 3.1] 
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Remark 3.1. For k odd, h even, the formula [6, Eq. (13)] receives the following 
representation: 


k 


Ga 2j—1 
si(h,k) = 7 be cot (=) cot (=) , 


j=l 
JA(k+1)/2 


which can easily be transformed into 


(k-1)/2 


s\(h,k) = : i> tan (~) cot (7) 


j=l 


that is equal to the right-hand side of (13). Similar considerations are valid for the 
corresponding formulas on the Hardy sums s4(h, k), s5(h, k), and S(h, k). 


Remark 3.2. The finite sum identities (7.1)-(7.3), (7.5), and (7.6) from the paper 
[14] contain some misprints. Namely, in formulas (7.1) and (7.5) the sum ee, 
should be }7\_,, while in (7.2) and (7.6) the sum °\"\ ,24q41/2 should be 


r=1? 
ae +1)/2» the missing terms being nonzero. Furthermore, in formula (7.3) the 
sum y should be ins the term for r = k/2 (namely tan(z/2)) being 
not defined. 


One could possibly investigate some further higher dimensional generalizations 
and analogues of the Hardy sums involving the Bernoulli functions, however we do 
not discuss this in the present paper. 


3.3 Sums Involving the Hurwitz Zeta Function 


Theorem 2.1 and its corollaries can be applied in several other situations as well. 
For example, let 


= 1 
C(s,x) := Ga 


be the Hurwitz zeta function, where 0 < x < 1 and ¢(s,1) = ¢(s) is the Riemann 
zeta function. The function 
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v= Bs (f 24) (o 48) 


investigated by Mikolas [11], is an analogue of the Dedekind sum (10), taking into 
account that 


B,(x) = —nG(1—n,x) (neN, O0<x<1). 


Let 


00 7, 2minx 


F(s,x):= > 


n=1 


(x € R) (15) 


be the periodic zeta function, which converges for is > O if x ¢ Z and for Xs > 1 
ifx eZ. 
Applying Corollary 2.1 to the functions 


fn) = F (5, ) (n€Z, 1<j<2), 
we deduce by Lemma 2.1/(v) the next new result. 


Theorem 3.6. Let k € N, hy,ho € Z, ged(h), k) = ged(hn,k) = 1 and let 51,52 € 
C, Ns, Rs2 > 1. Then 


Lo | 
D(h,k) = ("1 — 1)E (61) (62) + AP! F (5. a F(s-) 
a=1 


4 Some Further Remarks 


The following simple and useful result can be applied to obtain infinite series 
representations for the Dedekind and Hardy sums. 


Lemma 4.2. [ff :N — C is ak-periodic (k € N) odd function, then 
fr) 2 = mr 
S(f:= > =F Ls cot (=) (16) 


. k-1 
=-5 Fo. (17) 
r=1 
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For the Dedekind sum, (2) is a direct consequence of identities (16) and (1). As 
another example, (16) and (12) imply that for k odd, gcd(h, k) = 1, one has 


[o) 


1 1 mrh 
h,k) = t j 
53(h, k) ie an( i ) 


r=1 


One could see [6, Theorem 1], [14, Theorem 7.1] for the above formula as well 
as for similar representations regarding Hardy sums. 

Identity (16) of Lemma 4.2 was proved in [14, Lemma 2.1] by applying results of 
Lehmer [10] on the generalized Euler constants y(r, k) associated with the infinite 
arithmetic progression r,r + k,r+ 2k,...(1 < r < k), where 


1 1 
k):= I --—-|] 
y(r,b = lim | 7 —— Flogx 
l<n<x 
n=r (mod k) 


Berndt [5] deduced (17) by contour integration (with a different definition of 
the DFT). The fact that the finite sums (16) and (17) are equal provides another 
simple consequence of Corollary 2.2, applied to the odd functions f : N — C and 


n> ((Z)): 


Furthermore, according to [10, Theorem 8], if f is a k-periodic function, then 


k 
Sf) = ofOvC.#. 


r=1 


provided that eer f(k) = 0, representing a necessary and sufficient condition for 
the convergence of the series S(f) (which holds if f is a k-periodic and odd function). 
We note that the DFT of the k-periodic function rb y(r,k) is 


F(i,-!), if kty, 
y, if k|r 


(cf. [10, p. 127]), where F(s,x) is the periodic zeta function defined by (15) and 
y := y(0, 1) is Euler’s constant. Therefore, we deduce by Corollary 2.1 the next 
identity. 


Corollary 4.12. [ff : N — C is ak-periodic (k € N) odd function, then 


S(f) = + SWF (1, -=) 
r=1 
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On Arithmetic Properties of Products 
and Shifted Products 


Joél Rivat and Andras Sarkézy 


Dedicated to Helmut Maier on the occasion of his 60th birthday 


Abstract Let & and & be large subsets of {1...,N}. Arithmetic properties of the 
products ab, resp. of the shifted products ab + 1 witha € &,b € & are studied. 
In particular, it is shown that the sum of digits of the products ab is well distributed, 
and the size of the subsets 7, @ € {1...,N} with the property that ab+ 1 is always 
squarefree is estimated. 


Keywords Product of sequences * Shifted products * Digit properties 
¢ Squares * Multiplicative characters * Large sieve 
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1 Introduction 


Throughout this paper we will use the following notations: N and R denote the 
set of positive integers, resp. real numbers. w(n) denotes the number of distinct 
prime factors of n, y(n) is Euler’s function, and (x) is the number of primes 
not exceeding x. The notation S means that y runs over the (multiplicative) 
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characters modulo m, while pe denotes that y runs over the primitive characters 
x mod m 
modulo m. ||... || denotes the distance to the nearest integer and e(t) = exp(2izt). 

In [1] we presented a possibly complete list of the papers written on arithmetic 
properties of sumsets, and we studied several further problems of this type. This 
paper is the multiplicative analogue of Balog et al. [1]: our goal is to present the 
possibly complete list of the papers written on arithmetic properties of products 
and shifted products. Note that when we are looking for the multiplicative analogue 
of a problem on a certain arithmetic property of sums a + b with a, b taken from 
“large” sets of, # of integers, then the most natural way is to replace a + b in the 
given problem by the product ab. However, if the arithmetic property studied is of 
multiplicative nature, then the problem obtained in this way is usually trivial; in such 
a case, one may help by taking shifted products ab + | instead of the products ab. 
For example, Erdés et al. [11] proved that if <7, # are large subsets of {1,...,N} 
then the sums a + b witha € &, b € Z& satisfy an Erdés—Kac type theorem, 
ie., the distribution of (w(a + b) — log log N) /(log log N)'/?can be approximated 
by the normal distribution (see also [8] and [38]). If we replace w(a + b) by w(ab), 
then we do not get an interesting problem, since because of the additivity of the w 
function we may separate a and b in w(ab): w(ab) = w(a) + w(b) if (a,b) = 1. On 
the other hand, considering w(ab + 1) instead of w(ab), again we get an interesting 
problem; indeed, Elliott and Sark6zy [9] showed that the shifted products ab + 1 
also satisfy an Erdés—Kac type theorem. 

The first problem on arithmetic properties of (shifted) products was studied by 
Diophantus more than 2000 years ago. He asked the following question: how many 
positive integers a; < dj) < +++ < a, can be given so that aja; + 1 is a square for 
all 1 < i <j < k? Fermat and Euler also studied this problem, and finally Dujella 
[7] settled this problem in 2004. In the last 50 years many further papers have been 
written on arithmetic properties of products and shifted products: [2—4, 9, 12, 15- 
17, 17-22, 27-37]. 

In this paper we will study two further problems on arithmetic properties of 
products and shifted products. First in Sects.2 and 3 we will show that the sum 
of digits of the products ab witha € &,b € & (where & and Fare large subsets of 
{1...,N}) is well distributed, and then in Sects. 4 and 5 we will study the following 
question: how large can be the subsets <7, # of {1,...,N} with the property that 
ab + 1 is squarefree for every a € @,b € #? 


2 The Sum of Digits of Products ab 


Let g = 2 be an integer. Every integer n > 0 can be written uniquely in base gq: 


n= So ng! 


k20 
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where the digits n, € {0,...,q — 1}. The sum of digits function s,(m) is then 
defined by 


Sq(n) = » Nk. 


k20 


The sum of digits function appears in many mathematical problems (see [23] for 
a survey of these questions). Gelfond [14] proved that the sum of digits function 
is well distributed over arithmetic progressions and conjectured that a similar 
result holds true over the prime numbers and over polynomial sequences. Recently 
Mauduit and Rivat proved Gelfond’s conjecture for any basis gq > 2 both for the 
prime numbers [25] and for the squares [24], while Drmota, Mauduit and Rivat [6] 
managed to prove Gelfond’s conjecture for polynomials for a sufficiently large basis 
q (depending on the degree of the polynomial). 

In this paper our aim is to prove the analogue of these results for products ab 
withhae & Cf{l,...,N}andbe AC {1,..., N}, namely 


Theorem 2.1. For any integer q = 2 there exists an explicit constant cg > 0 
[see (4)] such that for any subsets &, B of {\,...,N} we have 


Y Lele s,(ab))| <q NCP) Jog nv (1) 
aca beB 
uniformly for a € R, where || - || denotes the distance to the nearest integer and 


e(t) = exp(2izt). 


(It follows from this theorem that if m grows “slowly” in terms of q and N, then the 
numbers s,(ab) are well distributed modulo m.) 


3 Proof of Theorem 2.1 


It follows from Proposition 3.2 of [5] (which is a variant of Proposition | of [25]) 
that there exists 6 > 0 such that, uniformly for all complex numbers a,, and b, with 
lam| < 1 and |b,| < 1, and all a € R we have 


DE am bn eCarsg(mn))] Kq (we + v) Geel VA), (2) 


gt! <m<qt q’—|<n<q" 
whenever ju and v are positive integers satisfying 


1 2 
aie 2 ag. (3) 
3 a) 
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where 
1. mw q—1 
Cq i= => min | 4a@,, —————_— } . (4) 
28 12 (¢+ llogg 


with 


@=1- 


log(2 + V2). w= (3 ae a 
q ae ey 


2log2 2 log3 log q 


If &, Z& are subsets of {1,...,N}, we can define 
a’ = AN(N-F,N), B=BOIN FI. 


Then 


Y= Yo ews, (ab)) = > Y> e(as,(ab)) + O(n). 


aca beB ac! be B’ 
Observing that ||(q — 1)a||? < 4 we deduce that in (1) we may replace &/ by /’ 


and BZ by &' at the price of a negligible error term. Therefore from now on we may 
assume that 


A C(N-F MN, Bc in -tM. (5) 
For positive integers jz and v we define 


B= Ag "4+1,¢"], B=BN ig? +1,q’]. 


We have 
lool V1 
d=|)m%, =U 
L=L0 v= V0 

with 

Cq\ log N log N 

Ho =v =1+ (1-2) M1 =|~= 

4/ logg log q 

Hence for N large enough 
1 Mo - BK | fh 2 


< < < < 
3 Mit vy a+y [Lo + Vo 3 
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which implies that condition (3) is satisfied (actually for any 6 > 0). It follows that 
we can apply (2) with 


a lifme &, | p, - J lifne B, 
™~) 0 otherwise. ’ ” _) O otherwise. 
We obtain 
—~e,\(g—Dall2 
YD dS e@s,(ab))| Kq (u + v) Qu treme Del), (6) 


deh, bEB, 


We have for all 6 € (0, 1) 


HI ML 
3S gi? < gitHe Kq N°, 3 gq’? < git Kq No, 
B= Ho v=V9 
and w+v < w+; < logN. Using these estimates with 6 = 1 —c,||(q— 1)e||? 
together with (6), we can sum non-trivially over jz and v and obtain 


V1 


Hl 
<> YY DY eas, (ad)) 


H=Lo V=V0 |a€ xh, bDEBy 


b> > e(a s,(ab)) 


aca bEB 


&; N20-eallQ—Dal) og N, 


which is (1). 


4 Squarefree Shifted Products 


Erdés and Sark6zy [10] studied the following problem: how large can a subset </ C 
{1,...,N} be with the property that a + a’ is squarefree for alla € &@,a’ EW? 
They proved the following results: 


Theorem A For N > No there exists a subset &@ C {1,...,N} such that 


1 
@A| > ——logN 
Hl aa OE 


and a+’ is squarefree for alla € &, a! € &. 


Theorem B [fN > Ni, & C {1,...,N} and a+’ is squarefree for alla € &, 
a’ € &, then we have 


|.o/| < 3N*/* log N. (7) 
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Next we will study the multiplicative analogue of this problem in the more 
general form that we take two subsets &, A C {1,...,N} such that the shifted 
products ab + 1| are squarefree for alla € «&, b € &, and the question is that how 
large is M(N) = max.g || |Alfor subsets <7, A with these properties? We will 
prove the following theorem in Sect. 5: 


Theorem 4.2. IfN > No, &,B C {1,...,N} and ab + 1 is squarefree for all 
ace @,beE &, then we have 


min(|.e/| ,|B|) < 2N7/4 log N. (8) 

Note that for <f = & this gives 

|.o| < 2N7/* log N. 
(Compare this with (7) in the additive case.) 

It is stated in [10] (without proof) that for N — oo there exist sets 7, B C 
{1,...,N} such that |.o/||A|/N — oo and the sumsa+bwithae SG be B 
are all squarefree. Indeed, with some work one can construct such sets <7, 4 with 
|| >> N, |B| > logN. In the case of shifted products we have not been able to 
prove such a result. If we take YW = {n: 1 <n < N, n+ 1issquarefree and 


ZB = {i}, then we have |.7||A| = (+ o(1))SN and ab + 1 is squarefree for all 
ace fh, be &, thus trivially we have 


6 
d+ o(1))—N < M(N). 
TU 
On the other hand, by Theorem 4.2 we have 
MIN) < 2N7*logn. 


There is a large gap between these lower and upper bounds. We think that the trivial 
lower bound is closer to the value of M(N) and, perhaps, there are c,,c2 > 0 with 


N(log N)*! < M(N) < N(log N)*. 


5 Proof of Theorem 4.2 


Theorem B was proved by using the large sieve with prime square moduli. Instead, 
here we will use Gallagher’s (multiplicative) character version of the large sieve 
[13]. Using the large sieve with optimal constants [26], Gallagher’s sieve becomes 


On Arithmetic Properties of Products and Shifted Products 351 


Lemma 5.1. Fort > 1, M € Z, N € N and any complex numbers cy+1, Cu+2)---> 
Cu+n We have 


2 


k 7 M+N M+N 
ie yy | dS ax@] <W-1+?) Yo lal’. 
kt " x mod k |n=M+1 n=M+1 


Now assume that contrary to the statement of Theorem 4.2 we have 
min(|.o/| ,|B|) > 2N*/4 logN, (9) 


however, ab + 1 is squarefree for alla € &, bE Z. 
Then for every prime p we have 


0= [{(a,b): aE &, bE &, ab + 1 = 0 mod p’}| 


= rea ye 2-DY xla), 


x mod p aEa bEB 
so that 
Xo(-1) Y2 YS xolab) =- YS X-1D) YY x@d), 
aed bEB x mod p? aed bEB 
XFXo 
1.e. 


I{(a,b), ae A, be B, (ab,p)=l=- Y X-D) DV x@ Dox), 


x mod p? acd beB 
XFXO 


whence 


I{(a,b), ae A, bE Z, (ab,p)=YIl< Do 


x mod p* 
XF Xo 


> x(a) 


acd 


we) 


beB 


(for every p). Write Q = N'/4, Then consider this inequality for every p with Q/2 < 
p < Q, and add these inequalities. Denoting the sum on the left-hand side by S, 
we get 


S= SY |ka,b), ae a, bE Z, (ab,p) = 1} (10) 
Q/2<p<Q 


a> 


Q/2<pS<Q y mod p? 
XF Xo 


Yo x@ 


aca 


Ye x) 


bEeB 
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By (9) and the prime number theorem we have 
S= > K@b), ae, be &, (ab,p) = 1}| (11) 
Q/2<p<@Q 


> > Kab), ae a, be B}| 
Q/2<p<Q 


—l{(a,b), aE GA, bE, p | b}|-l{(a,b), a€ LH, bE Z, p | a}| 


> |e/\|Alo(Q) - x(0/2)- YY (11 + 11%) 


Q/2<p<Q 


2 |#||BA| (x(Q) — 1(Q/2)) — (| + |A)) a - m(Q/2)) 


= (|| |B| — 2 (|| + |Al) N*) (x(Q) — x(Q/2)) 


2 
og O/2 


On the other hand, it follows from (10) by the Cauchy—Schwarz inequality that 


Ni/4 
= (1+ o(1)) M41 75 He OO EN El ay 


1/2 1/2 


se) +> SO yD rol 


Q/2<pSQ y mod p? 14EL Q/2<pSQ y mod p? |LEB 
XFX0 XFX0 


2 
(12) 


< si2gs¥? 
where 
2 


=r Y 


PQ x mod p? 
XF Xo 


Yo x@ 


aed 


and Sg is defined similarly with 4 in place of <. A non-principal character y 
modulo p? is not primitive character modulo p? if and only if it is a primitive 
character modulo p. Thus we may rewrite and estimate the sum S,, in the following 
way: 

2 2 
21a) + DU 


Sa =>- > x(a) 


PSQ \ mod p? lac# x mod p |aEee 
2. 

Day |r xe 

k<Q? - X mod k |aE 
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This sum can be estimated by using Lemma 5.1 (with c, = lifn € & andc, = 0 
ifn ¢ &). Then we get that 


Sg <(N-1+Q*) }°1<2N/Al, (13) 
aca 
and in the same way we get 
Sg@<2N|B\. (14) 


It follows from (11), (12), (13) and (14) that 


Ni/4 
(2 + o(1)) |A||A| <S<2N(A| |B)”, 
log N 


whence 
(JA| |B)? < (1 + 0(1))N* log, 


which for N large enough contradicts our indirect assumption (9), and this completes 
the proof of Theorem 4.2. 
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Narrow Progressions in the Primes 


Terence Tao and Tamar Ziegler 


To Helmut Maier on his 60th birthday 


Abstract In a previous paper of the authors, we showed that for any polynomials 
P,,..., Px € Z[m] with P,\(0) = ... = P,(0) and any subset A of the primes in 
[N] = {1,...,N} of relative density at least 6 > 0, one can find a “polynomial 
progression” a+ P;(r),...,a+ P,(r) in A with 0 < |r| < N°, if N is sufficiently 
large depending on k, P;,..., Px and 6. In this paper we shorten the size of this 
progression to 0 < |r| < log” N, where L depends on k, P},..., Px and 68. In the 
linear case P; = (i— 1)m, we can take L independent of 6. The main new ingredient 
is the use of the densification method of Conlon, Fox, and Zhao to avoid having to 
directly correlate the enveloping sieve with dual functions of unbounded functions. 


1 Introduction 


1.1 Previous Results 


We begin by recalling the well-known theorem of Szemerédi [18] on arithmetic 
progressions, which we phrase as follows: 


Theorem 1.1 (Szemerédi’s Theorem). Let k > 1 and é > 0, and suppose that N 
is sufficiently large depending on k,6. Then any subset A of [N] := {né€ Z:1< 
n < N} with cardinality |A| > 6N will contain at least one arithmetic progression 
aatr,...,a+ (k— 1)r of length k, with r > 0. 
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In fact, by partitioning [N] into intervals of some sufficiently large but constant 
size L = L(k,d) and using the pigeonhole principle, one can ensure that the 
progression described above is “narrow” in the sense that r < L(k, 6). 

In [10], Szemerédi’s theorem was relativized to the primes P = {2,3,5,7,...}: 


Theorem 1.2 (Szemerédi’s Theorem in the Primes). Let k > 2 and 6 > 0, and 
suppose that N is sufficiently large depending on k, 5. Then any subset A of [N] 1 P 
with |A| => 6|[N]NP| will contain at least one arithmetic progressiona,a+r,...,a+ 
(k — 1)r of length k, with r > 0. 


In particular, the primes contain arbitrarily long arithmetic progressions. 

The proof of Theorem 1.2 does not place a bound on r beyond the trivial bound 
r < N.Incontrast with Theorem 1.1, one cannot hope here to make the step size r of 
the progression as short as L(k, 5). Indeed, as observed in [1], if N is large enough, 
then from [8, Theorem 3] one can find a subset A of [N] OP with |A| > 3|[N] N P| 
(say) such that the gap between any two consecutive elements of A is > clog N for 
some absolute constant c > 0, which of course implies in this case that r must be 
at least clog N as well. Indeed, one can improve this lower bound to log’! N by a 
small modification of the argument: 


Proposition 1.1. Let k > 2, let ¢ > 0 be sufficiently small depending on k, and 
suppose that N is sufficiently large depending on k. Then there exists a subset A of 
[N] NP with |A| >> |[N]NP| such that A does not contain any arithmetic progression 
aat+r,...,a+(k—1)roflengthk with0O<r< elog! N. 


Proof. For any 0 < r < elog*'N, let N, denote the number of a € [N] such that 
a,a+r,...,a+ (k— 1)r are all prime. By the union bound and the prime number 
theorem, it will suffice to show that 


1 
pa we a log N 


0<r<e log’! N 


(say), since one can then form A by removing all the elements a in [N] N P that are 
associated with one of the N,. But from standard sieve theoretic bounds (see, e.g., 
[12, Theorem 5.7]) we have 


N 
N, < C.G(k, 1), — 
log" N 


where C; depends only on k and 6(k, r) is the singular series! 


k 
G(k,r) =TI(1-=) (1-2) (1) 


7 P 


‘Al sums and products over p in this paper are understood to be ranging over primes. 
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and v,(r) is the number of residue classes a € Z/pZ such that at least one of a,a + 
r,...,a+ (k— 1)r is equal to 0 mod p, so it suffices to show that 


~ S(k,r) < C.M 


0<r<M 


for any M > 1 (we allow C; to represent a different k-dependent constant from 
line to line, or even within the same line). One could obtain precise asymptotics on 
the left-hand side using the calculations of Gallagher [7], but we can obtain a crude 
upper bound that suffices as follows. From (1) we see that 


1 
G(k,r) < Cyexp Gy, — 


pir 


and hence by [19, Lemma E.1] 


log 
Gk) < Co 


pir 


and thus 


log pM 
Blin) SC) Ts OM 


O<r<M P 


as required. 


In the converse direction, if we use the “Cramér random model” of approximating 
the primes P M [N] by a random subset of [N] of density 1/logN, we can 
asymptotically almost surely match this lower bound, thanks to the work of Conlon- 
Gowers [5] and Schacht [17]: 


Proposition 1.2. Let k > 2 and 6,¢ > 0, let C > 0 be sufficiently large depending 
on 6,k, and suppose that N is sufficiently large depending on k, 6, €. Let P C [N] be 
chosen randomly, such that each n € [N] lies in P with an independent probability 
of 1/ log N. Then with probability at least | — ¢, every subset A of P with |A| => 6|P| 
will contain an arithmetic progression a,a+1r,...,a + (k — 1)r of length k with 
0<r<Clog'N. 


We remark that a modification of the argument in Proposition 1.1 shows that we 
cannot replace the large constant C here by an arbitrarily small constant c > 0. 


Proof. We partition [N] into intervals [,,..., Jn of length between £ log’! N and 


Clog*! N, thus m < ay: For each interval J;, we see from [17, Theorem 2.2] 
‘ be 


or [5, Theorem 1.12] that with probability at least 1 — 55, every subset A; of PM Jj 
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with |A;| > S\P 7 J;| will contain an arithmetic progression of length at least k. Call 
an interval J; bad if this property does not hold, thus each J; is bad with probability 
at most 6e/10. By linearity of expectation followed by Markov’s inequality, we 


conclude that with probability at least 1 — ¢, at most SETH of the J; are bad. 


Then if A C P is such that |A] > 6|P|, then at most SIP of the elements of A 
are contained in bad intervals, so from the pigeonhole principle there exists a good 
interval J; such that |A N J;| > S\P M I;|, and the claim follows. 


It is thus natural to conjecture that in Theorem 1.2 one can take r to be as small 
as log! +0 ny, 


Remark 1.1. If one seeks progressions inside the full set P of primes, rather than 
of dense subsets of the primes, then the Hardy—Littlewood prime tuples conjecture 
[13] predicts that one can take r to be of size O;,(1); indeed, one should be able 
to take r to be the product of all the primes less than or equal to k. In the case 
k = 2, the claim that one can take r = O(1) amounts to finding infinitely many 
bounded gaps between primes, a claim that was only recently established by Zhang 
[20]. For higher k, the claim r = O,(1) appears to currently be out of reach of 
known methods; the best known result in this direction, due to Maynard [15] (and 
also independently in unpublished work of the first author), shows that for any 
sufficiently large R > 1, there exist infinitely many intervals of natural numbers 
of length R that contain > clog R primes for some absolute constant c > 0, but this 
is too sparse a set of primes to expect to find length & progressions for any k > 3. 


We now consider generalizations of the above results, in which arithmetic 
progressions a,a+r,...,a+ (k— 1)r are replaced by “polynomial progressions” 
a+ Pi(r),...,a+ Px(r). More precisely, let Z[m] denote the ring of polynomials 
of one indeterminate variable m with integer coefficients. Then Bergelson and 
Leibman [3] established the following polynomial version of Theorem 1.1: 


Theorem 1.3 (Polynomial Szemerédi’s Theorem). Let k > 1, let P\,...,Px € 
Z{m] be such that P\(0) = ... = Px(0), let 6 > 0, and suppose that N is sufficiently 
large depending onk, P\,..., Px, 6. Then any subset A of |N] with cardinality |A| = 
6N will contain at least one polynomial progression a + P\(r),a + P2(r),...,a+ 
P,(r) with r > 0. 


Of course, Theorem 1.1 is the special case of Theorem 1.3 when P; = (i — 
1)m. As with Theorem 1.1, a partitioning argument shows that we may take r < 
L(k, Pi,..., Px, 6) for some quantity L depending on the indicated parameters. The 
polynomial analogue of Theorem 1.2 was established by the authors in [19]: 


Theorem 1.4 (Polynomial Szemerédi’s Theorem in the Primes). Let k > 2, let 
Pi,...,Px € Z[m] be such that P\(0) = ... = P;(0), €,6 > 0, and suppose 
that N is sufficiently large depending on k, P\,...,Px,6,€. Then any subset A of 
[IN] N P with |A| > 6|[N] Q P| will contain at least one polynomial progression 
at+P\(r),a+ Po(r),...,a+ P(r) with0<r< N°. 
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In particular, this implies Theorem 1.2 with a bound 0 < r < N°), 


Remark 1.2. The condition P;(0) = ... = P,(0) in Theorem 1.3 can be relaxed to 
the property of intersectivity (that P;,...,P, have a common root in the profinite 
integers Z= limgoe-m Z/mZ); see [4]. It is not yet known if Theorem 1.4 can 
similarly be relaxed to intersective polynomials, except in the k = 2 case which was 
established in [14]. We will not pursue this matter here. 


1.2. Main New Result 


Our main result is to improve the bound on the r parameter in Theorem 1.4 to be 
polylogarithmic in size: 


Theorem 1.5 (Short Polynomial Progressions in the Primes). Let k > 2, let 
P,,...,Px € Z[m] be such that P\(0) = ... = P,(0), €,6 > 0, and suppose 
that N is sufficiently large depending on k, P\,..., Px, 6,€. Then any subset A of 
[N] N P with |A| > 6|[N] Q P| will contain at least one polynomial progression 
a+P,(r),a+ Po(r),...,a+ Py(r) with 0 < r < log” N, where L depends only on 
k, Py,..., Px, 6. 


In particular, there are infinitely polynomial progressions a + Pi(r),a + 
P2(r),...,a + P;(r) consisting entirely of primes with 0 < r < log’ a, with 
L now depending only on k, P;,..., Px. This is new even in the case of arithmetic 
progressions a,a+r,...,a+ (k—1)r. 

A modification of the proof of Proposition 1.1 shows that some power of log N is 
needed in the upper bound on r in Theorem 1.5. However, we do not know what the 
optimal value of L is; our argument for general P;,..., P, uses the PET induction 
method [2], and as such L will grow rapidly with the degrees of the P},..., Px. The 
dependence of L on 6 occurs for technical reasons, and we conjecture that one can 
in fact select L to be independent of 5; we can verify this conjecture in the arithmetic 
progression case P; = (i—1)m, and in fact we can take the explicit value L := Ck2* 
in this case for some fixed constant C (actually C = 3 would already suffice). We 
discuss this explicit variant of Theorem 1.5 in Sect. 7. Propositions 1.1, 1.2 suggest 
that we should in fact be able to set L = k — 1 + ¢ in these cases, although our 
methods do not seem strong enough to achieve this, even in the k = 3 case. 

It is possible that one might be able to directly modify the arguments in [19] 
(which are in turn based on those in [10]) to establish Theorem 1.5; the main 
technical difficulties in doing so are the need of having to correlate the enveloping 
sieve with dual functions of unbounded functions and verifying the “correlation 
condition” required in that argument, in the setting when r is as small as log” N (and 
the argument appears to have no chance of working when L is independent of 5). On 
the other hand, the need for the analogous correlation and boundedness conditions 
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in [10] to prove Theorem 1.2 were recently removed’ by Conlon et al. [6], using 
a new method which they refer to as “densification”. We will be able to combine 
the densification method with the arguments in [19] to establish Theorem 1.5. As a 
consequence, we also obtain a slightly different proof of Theorem 1.4 than the one 
in [19], in which the (rather complicated) verification of the correlation condition is 
no longer necessary, but the densification arguments of Conlon, Fox, and Zhao are 
inserted instead. 


2 Preliminary Reductions 


We now begin the proof of Theorem 1.5. We use the following asymptotic notation. 
We let N’ be an asymptotic parameter tending to infinity along some sequence 
N = N; of natural numbers. All mathematical objects in this paper are implicitly 
permitted to depend on N’, unless they are explicitly designated to be fixed, in which 
case they are independent of N’. We use X = O(Y), X < Y, or Y > X to denote 
the estimate |X| < CY for some fixed C, and X = o(Y) to denote the estimate 
|X| < c(N’)Y where c(N’) — 0 as N’ — oo. Our statements will be implicitly 
restricted to the regime in which N’ is sufficiently large depending on all fixed 
parameters. 

Suppose for the sake of contradiction that Theorem 1.5 failed. Carefully negating 
the quantifiers (and relabeling N as N’, for reasons that will be clearer later), we 
conclude that we may find a fixed k > 2 and fixed polynomials P;,...,P, € Z[m] 
with P\(0) = ... = P,(0), fixed 6 > 0, a sequence N’ = Ni of natural numbers 
going to infinity, and a set A = Ay C [N’,2N’] MN P such that 


|A| = 5|[N’] 9 P| 


and such that for any fixed L > 0, there are no polynomial progressions a + 
Pi(r),a+ P(r),...,a+P,(r) in A with 0 < r < log” N’ (recall that we assume N’ 
sufficiently large depending on fixed quantities such as L). 

The first few reductions are essentially the same to those in [19]. We begin with 
the “W-trick” from [10] to eliminate local irregularities modulo small primes. We let 
w = wy be a sufficiently slowly growing function of N’; for instance, we could take 


The “dual function condition” that the dual function of the enveloping sieve is bounded already 
failed in the arguments in [19], which was a significant cause of the complexity of that paper due 
to the need to find substitutes for this condition (in particular, the correlation condition became 
significantly more difficult to even state, let alone prove). But the arguments in [6] do not require 
any version of the dual function condition at all, leading to some simplifications in the current 
argument over those in [19]. 
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wis a log log log N’ as in [19] for the sake of concreteness, although the precise 
value of w is unimportant. We then define the quantity 


and the natural number 


From the prime number theorem? we have 


NW 
A ee ees 
ae log N 


and all elements of A larger than \/N (say) are coprime to W. Thus, by the 
pigeonhole principle, we may find b € [W] coprime to W (and depending on N’ 
of course) such that 


l{n € [N]:nW+beEA}| > @(W) logN’ 


where @ denotes the Euler totient function. 

The domain [N] is not quite translation invariant. In order to eliminate this 
(minor) difficulty, we will follow [19] and work instead in the cyclic group X := 
Z/NZ using the obvious embedding 1 : [V] — X. We give this space the uniform 
Haar probability measure, thus 


[f=3 D0 


xEX 


for any f : X — R. We also define shift maps T’f : X — R for any h € Z by 
Tf) :=f@ +h); 
clearly we have the identities 
Cre Tey 
T" (fg) = (T'f)(T"g) 
[re =the 


for any h,k € Zand f,g : X — R. We will use these identities frequently without 
further comment in the sequel. 


3 Actually, the weaker lower bound z(x) >> —~ of Chebyshev would suffice here. 


logx 


364 T. Tao and T. Ziegler 


We will need a fixed quantity €) > 0 (depending only on k, P;,..., Px) to be 
chosen later. We define the function f : X — R by the formula 


_ € &(W) logN 
fun) = FO 


Liv—-va\tvn|() La(nW + 5) (2) 


for n € [N], where 1,4 denotes the indicator function of A; the reason for the 
normalizing factor 7f is so that f can be dominated by an enveloping sieve v, to 
be defined later. We then have 


[r> i: (3) 
xX 


where we allow the implied constants here to depend on the fixed quantities 6 
and €. 
Now let L > O be a sufficiently large fixed quantity (depending on 


k, P,,...,P,, 5) to be chosen later. We will need the “coarse scale” 
M := log’ N (4) 
which will basically be the domain of interest for the polynomials P),..., Px, and 


the “fine scale” 
AH:= log¥= N, 


which will basically be the scale used for various applications of the van der Corput 
inequality. Note in particular that any quantity of size O(H°) /M) will also be 
o(M) if the implied constants do not depend on L, and L is large enough. 

By hypothesis, if N (or N’) is large enough, then the set A contains no polynomial 
progressions of the form a+ P\(Wm),...,a+ P.(Wm) with m € [M]. In particular, 
we see that 


A(f,....f) =0 (5) 
where A is the k-linear form 
A(fi. xo fe) ‘= ome (M] i} TP Wn) /W ke TP Wm) /W q (6) 
x 
for k functions fi,...,f; : X — R, and we use the averaging notation 


1 
Eneaf (a) := Al Y F(a). 


acA 
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Note that there are no “wraparound” issues caused by the embedding into Z/NZ, 
due to our removal in (2) of the elements n € [N] that are less than JN or larger 
than N— JN. 

On the other hand, using the (multi-dimensional) polynomial Szemerédi theorem 
of Bergelson and Leibman [3], we have the following quantitative version of 
Theorem 1.3: 


Theorem 2.6 (Quantitative Polynomial Szemerédi Theorem). Let 56 > 0 be 
fixed, and let g : X — R obey the pointwise bounds 
O<g<l 


(i.e., 0 < g(x) < 1 forall x € X), as well as the integral lower bound J, g => 6—o(1). 
Then we have 


A(g,...,8) = c(8) — o(1) 


where c(6) > 0 depends only on 6, k, and P,,..., Px. 


Proof. See (19, Theorem 3.2]. In that theorem, the common value P)|(0) =... = 
P,(0) of the P; was assumed to be zero, but the general case follows from this case 
by a simple change of variables. 


This theorem cannot be directly applied to control A(f,...,f), because f is 
not uniformly bounded. However, Theorem 1.5 will now be a consequence of the 
following claim. 


Theorem 2.7 (Approximation by Bounded Function). Suppose that €) > 0 is 
sufficiently small (depending on k, P,,..., Px). Let ¢ > 0 be fixed. Suppose that L is 
a fixed quantity which is sufficiently large depending on k, P\,..., Px, €, €p. Let f be 
as in (2). Then there exists g : X — R with the pointwise bounds 


Vee=1 
such that 
[1- [5] «ero (7) 
x x 
and 
|A(f,...,f) — A(g,....9)| e+ o(1), (8) 


where the implied constants in the < notation do not depend on ¢ or L. 


Let us assume Theorem 2.7 for now, and see how it implies Theorem 1.5. Let 
€9 > 0 be small enough for Theorem 2.7 to apply, ¢ > 0 be a sufficiently small 
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fixed quantity (depending on 6, € 9) to be chosen later, and let L be large enough 
depending on k, P1,..., Px, €, €o (in particular, L will depend on 6). Let g be as in 
Theorem 2.7, let M be defined by (4), and A defined by (6) (in particular, A depends 
on L). From (3), (7), and the triangle inequality we have 


[er 
Xx 


if ¢ is small enough depending on €o, 5. Applying Theorem 2.6, we have 
A(g,...,g) > co—o(1) 


for some co > 0 depending on €o, 6 but not one, L. By (8) and the triangle inequality, 
we thus have 


A(f,...,f) > 0 


if e is small enough (depending on €o, 6) and N is large enough, contradicting (5); 
and Theorem 1.5 follows. 

It remains to establish Theorem 2.7. This is the focus of the remaining sections 
of the paper. 


Remark 2.3. The above arguments show that if one could remove the dependence 
of L on ¢ in Theorem 2.7, then one could also remove the dependence of L on 6 in 
Theorem 1.5. 


3 The Enveloping Sieve 
As in [10, 19], we view f as a “positive density fraction” of a well-controlled 
enveloping sieve v, which is defined by the explicit formula 


2 


] logd 
v(ein = SOBER | wax (PES) 
d|Wn+b 


for n € [N], where 
R= N*, (9) 


jz is the Mobius function, and vy : R > Risa fixed smooth even function supported 
on [—1, 1] with the normalization 


1 
/ IW@P dt = 1 
0 
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(where y’ is the derivative of y) and such that 7(0) > 1/2 (say). By construction 
and (2), we have the pointwise bound 


O<f<», (10) 
that is to say that O < f(n) < v(n) for alln € X. 
From [19, Corollary 10.5] we have the crude bound 


J 


[T]tv=1+0+0 Exp{O| )>- > 7 (11) 
xX 


j=l ISji<i'SJ wepsReR :p|hy—hj 


for any fixed J and any integers h,,...,hy = O(/N) (not necessarily distinct), 
assuming that € is sufficiently small depending on J, and where Exp(x) := e* — 1, 
where the implied constant in the O() exponent only depends on J. Here, of course, 
we use p|n to denote the assertion that p divides n. In particular, we have the mean 
bound 


ic 54 dati (12) 
x 
and the crude bound 


/ Ty ...TY vy & 10g°Y N (13) 
x 


for any fixed J and any integers 4,...,hy = O(N) (not necessarily distinct), 
assuming €o is sufficiently small depending on J, and where the implied constant in 
the O(1) exponent depends only on J. 

One can use (11) to establish the following fundamental pseudorandomness 
property: 


Proposition 3.3 (Polynomial Forms Condition). Let J,d,D > 1 be fixed natural 
numbers. Suppose that €9 is sufficiently small depending on J,d,D, and that L is 
sufficiently large depending on J,d,D. Then for any polynomials Q\,...,Q7 € 
Z{m,,...,mq] of degree at most D, with coefficients of size O(W?), and with 
Q; — Q; non-constant for every 1 < j < j’ < d, and any convex body 2 € 
[—M?, M?|¢ of inradius* at least H\/?, we have the asymptotic 


J 
mnconct | [ [ 7%» = 1+ 00). (14) 


j=l 


“The inradius of a convex body is the radius of the largest open ball one can inscribe inside the 
body. 
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This proposition was established in [19, Theorem 3.18] in the case when M is 
a small power of N; the point is that M can be lowered to essentially log” N for 
some large L. However, note that the number J of polynomials involved cannot be 
arbitrarily large depending on €9. The main obstruction to reducing the size of the 
coarse scale M is that we need the “diagonal” contributions to (14) (such as those 
coming from the terms where one of the Q;(h) vanishes) to be negligible when 
compared to the remaining terms. The sieve v (or powers thereof, such as v7) tends 
to have size log?" N on the average, and using this one expects to control diagonal 
contributions to (14) by something like log?) N/M, which will be negligible when 
M is a sufficiently large power of log N. 


Proof. We repeat the arguments from [19, §11]. For each h, we see from (11) that 


J 
[T] 72% =1 +00) +0 Exp | O a ye : 
x 


j=l 1<j<j’<J wSp<R'°28:p|Q;(h)—Oy (h) 


Thus it suffices to show that 


Sneenzd Exp | O y a BY ie o(1). 


I<i<j"<J w<p<R2®:p|Q;(h)—Q; (h) 


Using the elementary bound Exp(a+b) « Exp(2a)+Exp(2b) repeatedly, it suffices 
to show that 


1 
ne anzé Exp | O > —] | =o(). 
w<pRER:p]O;(h)—Oy (h) 


foreachl <j<j<d. 

We first dispose of the “globally bad” primes, in which p divides the entire 
polynomial Q; — Q;. As Q; — Qj; is non-constant and has coefficients O(we), 
we see that the product of all such primes is OCW), In [19, Lemma E.3], it is 
shown that 


y= =o) 


p=wip|n 


for any n = O(W), Thus the contribution of such primes in the above sum is 
negligible. 
In [19, Lemma E. 1], it is shown that 


() 
Exp | O y; Pp aie 


peA pEA 
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for any set A of primes. Thus it suffices to show that 
OW) p 


> log 


w<p<Rlosk P 


Se enz |p|a;(h)—O, (h);p10,-07 = (1). 


From [19, Lemma D.3], we conclude that if p does not divide Q; — Q;, then the 
average 1p\0;(4)—Q, (h) on any cube in Z/ of sidelength 1 < K < pis O(z). By [19, 
Corollary C.2], [19, Lemma C.4] and the inradius hypothesis we then have 


J 1 TH? 
“ne QnZ4 1 p|aj—hy < ; + Z 


Thus we reduce to showing that 


but this follows easily from Mertens’ theorem (or the prime number theorem) and 
the definition of M and w, if L is large enough. 


4 Averaged Local Gowers Norms 


As in [19], we will control the left-hand side of (8) using some local Gowers norms, 
which we now define. Given any d > 2 and integers a),...,aq and any scale S > 1, 


1 
I] pen Part. tm aa ¢ ( >) 


(@},....@g)€{0,1}4 
for f : X — R. Next, for any t > 2 and any d-tuple Q = (Q),...,Qa) of 
polynomials Q; € Z[h,,...,h,, W] int + 1 variables, we define the averaged local 
Gowers uniformity norms ye ™ on functions f :X — R by the formula 


2d F = 2d 
LAI quan >= lhe[A]! IAI) 01 @.1)...0400.0) - (16) 
s s 


These are indeed norms; see [19, Appendix A]. One can extend these norms 
to complex-valued functions by inserting an alternating sequence of conjugation 
symbols in the product, but we will not need to use such an extension here. We 
remark that these expressions may also be defined for d = 1, but are merely 
seminorms instead of norms in that case. 
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From the Gowers—Cauchy—Schwarz inequality (see, e.g., [11, Appendix B]) and 
Holder’s inequality, we record the useful inequality 


) © a oa | Tm at my aa g 
“my are My NY goers my €[S] Y “ 


w€{0,1}4 (17) 
d 
< [] Wlljaa 
w€{0,1}4 
for any functions f,, : X > R for w € {0, 1} where we write m := (@,...,@,). 


In a similar spirit, we have the following basic inequality: 


Theorem 4.8 (Polynomial Generalized von Neumann Theorem). Suppose that 
€y > 0 is a fixed quantity which is sufficiently small depending on k, P,,..., Px, 
and that L is a fixed quantity which is sufficiently large depending on k, P\,..., Px. 
Then there exists fixed t > 0,d > 2 and a fixed d-tuple Q = (Q,...,Qq) of 
polynomials Q; € Zlhy,...,h;, W], none of which are identically zero, and which 
are independent of €9, L, such that one has the inequality 


|A(gi, oo 79) <« min gill ewan + o(1) 
le JM 
for some fixed c > 0 and all gi,...,g% : X — R obeying the pointwise bound 


lgi| <v+1foralli=1,...,d. 


Proof. This is essentially [19, Theorem 4.5] (which was proven by a combination 
of PET induction, the Cauchy—Schwarz inequality, and the polynomial forms 
condition), with the only difference being that H and M are now polylogarithmic 
in N, rather than polynomial in N. However, an inspection of the proof of [19, 
Theorem 4.5] shows that this does not affect the arguments (after replacing the 
polynomial forms condition used there with Proposition 3.3, of course); the key 
relationships between H, M,N that are used in the proof are that (HW)°) /M = 
o(M) and that (HWM)? = o(/N), where the implied constants depend only on 
k,P,,..., Px (and in particular are independent of €o, L), and these properties are 
certainly obeyed for the choice of H and M used here. (The bound (13) is sufficient 
to deal with all the error terms arising from use of the van der Corput inequality in 
this regime.) 


Setting go = ... = gx = 1, we obtain in particular that 
[1 < lel guna + 200 
x Vii 


(In fact, one can take c = | in this inequality by the standard monotonicity 
properties of the Gowers norms, see [19, Lemma A.3], but we will not need this 
improvement here.) 
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In view of this theorem, Theorem 2.7 is now a consequence of the following 
claim (after replacing ¢ with &°): 


Theorem 4.9 (Approximation by Bounded Function, Again). Let d > 2 and 
t > 0 be fixed, and let Q = (Q1,...,Qa) be a fixed d-tuple of polynomials 
QO; € Zlh,,...,h,, W], not identically zero. Let €¢9 > 0 be a fixed quantity that 
is sufficiently small depending on d,t,Q. Let ¢ > 0 be fixed, and let L be a fixed 
quantity that is sufficiently large depending on d, t, Q, €o, €. Let v be as above, and 
let f : X — R obey the pointwise bound 


O<f<v. 

Then there exists g : X — R with the pointwise bound 
Ve e2 1, 

such that 


If — sll ewan K € + off). (18) 
J 


It remains to establish Theorem 4.9. This is the objective of the remaining 
sections of the paper. 


Remark 4.4. As before, if one could remove the dependence of L on ¢ in Theo- 
rem 4.9, then one could remove the dependence of L on 6 in Theorem 1.5. Also, from 
this point on the number k of polynomials P;,..., Px, in Theorem 1.5 no longer plays 
a role, and we will use the symbol k to denote other (unrelated) natural numbers. 


5 The Dense Model Theorem 


Let d, t, Q be as in Theorem 4.9. The averaged local Gowers norm ||| owt) of a 
Jit 


function f : X — R can then be expressed by the identity 


Uleounn = f £9F (19) 
_ gQUHW),. 
where the dual function Df = D Nir f : X — R is defined by the formula 
Df = She [H]! E sie m®, m'?) es m €[/M] 
Il Toes m0) g (20) 


(@1,.-+5 wg)€{0,1}4\ {0}4 
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More generally, we define 


BDfoo eve {0,1}4\¢0}4 “= he [A]! im) aay m?) mn? les mS) €L Ml] 


Il Tota em 0108) (21) 


(@1,...,04) €{0,1}7\ {0}4 


for any tuple of functions f,, : X > R for @ € {0, 1}4\{0}4. 

Theorem 4.9 is then an immediate consequence of combining the following two 
theorems (with the function f appearing in Theorem 5.11 being replaced by the 
function f — g appearing in Theorem 5.10). 


Theorem 5.10 (Weak Approximation by Bounded Function). Let d > 2 and 
t > 0 be fixed, and let Q = (Qi,...,Qa) be a fixed d-tuple of polynomials 
QO; € Zlh,,...,h,, W], not identically zero. Let €¢9 > 0 be a fixed quantity that 
is sufficiently small depending on d,t,Q. Let ¢ > 0 be fixed, and let L be a fixed 
quantity that is sufficiently large depending on d, t, Q, €o, €. Let v be as above, and 
let f : X — R obey the pointwise bound 

O<f<v. 


Then there exists g : X — R with the pointwise bound 
Gees 1, 


such that 
[[u- QIA(Fa)weto,1}4\f0}4| K € + 0) (22) 
X 


for all functions F,, : X — R for w € {0,1}“\{0}4 with the pointwise bounds 
-1<F, <1. 

Theorem 5.11 (Densification). Let d > 2 and t > 0 be fixed, and let Q = 
(Q\,...,Q4) be a fixed d-tuple of polynomials Q; € Z[h,,...,h,, W], not identi- 
cally zero. Let €) > 0 be a fixed quantity that is sufficiently small depending on 
d,t,Q. Let ¢ > 0 be fixed, and let L be a fixed quantity that is sufficiently large 


depending on d, t,Q, €. Let v be as above, and let f : X — R obey the pointwise 
bound 


f|<v+1, 


and suppose that 


[£9 Fadoetonevor «Ket+o(l) (23) 
xX 
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for all functions F,, : X — R with the pointwise bounds —1 < Fy < 1. Then we 
have 


IIFIl eww “K e + o(1) 
Ji 


for some fixed c > 0 (independent of ©). 


We prove Theorem 5.10 in this section, and Theorem 5.11 in the next section. 

To prove Theorem 5.10, we invoke the dense model theorem, first established 
implicitly in [10] and then made more explicit in [9, 16, 19]. We use the formulation 
from [16, Theorem 1.1]: 


Theorem 5.12 (Dense Model Theorem). For every ¢ > 0, there is K = (1/e)? 
and ¢' > 0 such that, whenever F is a set of bounded functions from X to [—1, 1], 
and v : X — R®* obeys the bound 


| [w= DF..-Fet <eé 
x 


for all 0 < K’ < K and F\,..., Fx € F, and every function f : X > R with 
0 <f < v, one has a function g : X — [0,1] such that 


|[ ¢-eFlse 


forallF € F. 
This reduces Theorem 5.10 to the following calculation: 


Theorem 5.13 (Orthogonality to Dual Functions). Let d > 2 and t > 0 be fixed, 
and let Q = (Q),..., Qa) be a fixed d-tuple of polynomials Q; € Z[hy,...,h,, W], 
not identically zero. Let €) > O be a fixed quantity that is sufficiently small 
depending on d,t,Q. Let K > 0 be a fixed integer, and let L be a fixed quantity 
that is sufficiently large depending on d, t, Q, €9, K. Let v be as above. Then one has 


K 
[o- DT] 2G imoetonayo = 00) (24) 
k=1 


for all functions Fy : X > R fork = 1,...,K, @ € {0,1}“\{0}4 with the 
pointwise bounds —1 < Fy < 1. 


Remark 5.5. The fact that L depends on K here is the sole reason why L depends on 
6 in Theorem 1.5 (note that no parameter related to 6 or K appears in Theorem 5.11). 
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We now prove Theorem 5.13. Let d, t,Q,€9, K, L, v, Fx be as in that theorem. 
We expand out the left-hand side of (24) as the average of 


fo-ve im ) ELV MV i= 1d R= Lyon Kyo=0,1 


K 
d (aj) (0) 
I] I] Tia g —m; )Qi (Oe) Fy, 


(o1,...,@a)€{0,1}4\ {0}4 K=1 


(25) 


as h),..., hx ranges over [H]. 

We first deal with the degenerate cases in which Q;(hy) = 0 for some i,k. By 
the Schwarz—Zippel lemma (see, e.g., [19, Lemma D.3]), the number of tuples 
(hy,..., hx) with this degeneracy is O(H*''). Meanwhile, from (12) and the 
boundedness of the Fy», each expression (25) is O(1). Thus the total contribution 
of this case is O(H~'), which is acceptable. 

It thus suffices to show that the expression (25) is o(1) uniformly for all 
h,,..., hx with none of the Q;(h,) vanishing. 

The next step is to “clear denominators” (as in [19]). Fix hi,...,hx, and write 
D; := TI, |Oihy)| for i = 1,...,d. Then we have 1 < Dj < O(HW)?), and 
we can write 


D; = Oi(by)rix 


foreachi=1,...,d,k=1,...,K, and some r;, = O(HW)?), 


Let ee , ay ni) aan ae be elements of [M'/*]. Then if we shift each 


() 


variable m by rikn; °, We can rewrite (25) as 


[w-p Sn) LVM} —rign Wie] aig =1,...,.K;w=0,1 


(26) 
I] Tl TEE one m0) 2? =n), 
(@},...,@4) €{0, 1}4\ {0} 4 k= 1 


The shifted interval [/M JHrign® differs from [/M] by shifts by a set of cardinality 
O(M!/*(HW)?)), and so by (13) one can replace the former by the latter after 
accepting an additive error of O(log?) N)M!/4(HW)°) /,/M), where the implied 
constants in the exponents depend on K. It is at this point that we crucially use the 
hypothesis that L be large compared with K, to ensure that this error is still o(1). 
Thus (26) can be written as 


Je — DE “mm €[./MIVi=I.....d:k=,....Kjo=0,1 


(wj) (0) (a) ___ (0) 
de - hy) + (06? =n yD; 
Ter 10a) €£0.134\ {034 ne TE hi iy ES TE 


+o(1). 
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Averaging over all such i, we obtain 


E, © e.yMvi=l ire d3k=1,...,.K;@=0,1 Se 7 )) “n\”) nO) fd ay a e[u'/4] 


ey 


K fn am Joh) +n? —n 0; 
Tes a naeoain Me, Tia Mg IM; Oi(bi)+A(H —A;) oe 


+o(1). 


Shifting the integral Je by GD; and then using the Gowers— 
Cauchy—Schwarz inequality (17) (and the boundedness of the functions 


dO ; 
[il Tia mg —m; )Oi(be) Fr kw), We may bound this by 


But from expanding out the Gowers norm (15) and using Proposition 3.3 to estimate 
the resulting 22" terms (cf. [10, Lemma 5.2]), we see that 


v — 1174 =i aq 
I Way oba (1) (27) 


and Theorem 5.13 follows. 


6 Densification 


Now we prove Theorem 5.11. It will suffice to establish the following claim. 


Proposition 6.4. Let the notation and hypotheses be as in Theorem 5.11. Then one 
has 


(1) (wa) 
7 + m,  Qi(h)+...4+-m, “ Qa(h) 
ee ee 
we{0,1}4 


« ef + o(1) 


for some fixed c > 0 (independent of &), whenever (fu) e{o,1\4 iS a tuple of functions 
To 1X > R(with @ := (@1,...,@a)), such that one of the f,, is equal to f, and each 
of the remaining functions f,, in the tuple either obey the pointwise bound |fy| < 1 
or |fo| < v. 


Indeed, given the above proposition, then by triangle inequality and decomposi- 
tion we may replace the bounds |f,,| < 1 or |f.| < v with |f.| < v + 1, and then by 
setting f, =f for every w and using (19), we obtain the claim. 
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It remains to prove the proposition. We induct on the number of factors fi, for 
which one has the bound |f,,| < v instead of |f,,| < 1. First suppose that there are 
no such factors, thus |f,,| < 1 for all w except for one w, for which f, = f. By 
permuting the cube {0, 1}“, we may assume that it is Fxoya_ that is equal to f, with 
all other f,, bounded in magnitude by 1. But then the expression in (28) may be 
rewritten as 


[12 edwctor 
x 


and the claim follows from the hypothesis (23). 

Now suppose that at least one of the f,, (other than the one equal to f’) is bounded 
in magnitude by v rather than 1. By permuting the cube we may assume that |fyoya| < 
v. We then write the left-hand side of (28) as 


/ Foye BL 
x 


where f := (fu )wefo,1}4\{0}- By Cauchy—Schwarz, it thus suffices to show that 
[vor K e + 0(1) 
for some fixed c > 0. We will split this into two estimates, 
| [w= 1.6" = 00) 29) 
and 
[ (Pf) « & + ofl). (30) 


We set aside (29) for now and work on (30). Bounding all the components of f in 
magnitude by v + 1 and using Proposition 3.3, we see that 


/ (A)? «1, 
x 
so by Hdlder’s inequality, it suffices to show that 


/ |Sf| « & + o(1) 
Xx 
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(for a possibly different fixed c > 0). It thus suffices to show that 
|f eae <K & + o(1) 
x 


whenever g : X — R is such that |g| < 1. But this expression is of the form (28) 
with fio} replaced by g, and the claim then follows from the induction hypothesis. 

It thus remains to show (29). We can rewrite 

(at = gromenmy, 
where Q @ Q is the 2d-tuple 
Q0Q= (@Q.,...,Qa,Q1,..., Qa) 
and f2 = (f2.0)e{0,1}24\{0}4 is defined by setting 
fr,weto}4 = fo 

and 


fr,0}4@0 *= foo 
for w € {0, 1}¢\{0}4, and 


hr.w@o! :=1 


for w,w’ € {0, 1}4\{0!¢. Applying the Gowers—Cauchy—Schwarz inequality (17), 
we may thus bound the left-hand side of (29) by 


|v — 1] OPH.) I] Ike. |e@oun.w . 
vi w€{0.1}4\{0} vu 


Bounding f,, by v or 1 and using Proposition 3.3, we can bound 
[Leo Il _je@ownt.w) <1 
VM 
and further application of Proposition 3.3 (cf. (27)) gives 
lv — 1] ,e@ewa.m = (1) 
JM 


and the claim follows. 
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7 The Linear Case 


We now explain why in the linear case P; = (i — 1)m of Theorem 1.5, one may 


take L to be independent of 6. In the linear case, one can replace the averaged 


local Gowers norm U 7 


appears d = k—1 times; this follows by repeating the proof of [10, Proposition 5.3], 
after replacing some global averages with local ones. (In fact one could replace /M 
here by M!~° for any fixed o > 0.) As such, we can ignore the H parameter and the 
h averaging, and just prove Theorem 5.13 in the case when Q; = ... = Qy = 1. 
Here, the “clearing denominators” step is unnecessary, and so L does not need to be 
large depending on K, which by Remark 5.5 ensures that the final L is independent 
of 6. 


Remark 7.6. A more careful accounting of exponents (in particular, replacing (11) 
with a more precise asymptotic involving a singular series similar to that in (1)) 
allows one to take L as small as Ck2* for some absolute constant C; we omit the 
details. 
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